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(4)  Introduction 


Previous  studies  have  found  that  there  is  a  strong  eorrelation  between  mammographie 
breast  density  and  the  risk  of  breast  eaneer.  Mammographie  breast  density  has  been  used  by 
researehers  in  many  studies  to  estimate  breast  eaneer  risk  of  epidemiologieal  faetors,  monitor  the 
effeets  of  preventive  treatments  sueh  as  tamoxifen  or  dietary  interventions,  monitor  the  breast 
eaneer  risk  of  hormone  replaeement  therapy,  and  investigate  faetors  affeeting  mammographie 
sensitivity  and  eaneer  prognosis.  However,  most  studies  used  Breast  Imaging  Reporting  and 
Data  System  (BI-RADS)  density  rating  as  a  measure  of  mammographie  breast  density,  whieh 
contributes  large  inter-  and  intraobserver  variations  and  may  reduce  the  sensitivity  of  the 
analysis. 

The  goal  of  this  proposed  project  is  to  develop  a  fully  automated  technique  to  assist 
radiologists  in  estimating  mammographie  breast  density.  We  hypothesize  that  the  computerized 
technique  can  accurately  and  efficiently  segment  the  dense  area  on  digitized  or  digital 
mammograms,  thereby  eliminating  inter-  and  intra-observer  variations.  The  dense  area  as  a 
percentage  of  total  breast  area  thus  estimated  will  be  more  consistent  and  reproducible  than 
radiologists'  subjective  BI-RADS  rating.  To  accomplish  this  goal,  we  will  (1)  collect  a  large 
database  of  mammograms,  including  digitized  film  mammograms  and  digital  mammograms,  for 
training  and  testing  the  dense  area  segmentation  program;  (2)  evaluate  the  correlation  between 
the  radiologists'  breast  density  classification  based  on  BI-RADS  lexicon  and  the  percent  breast 
dense  area;  (3)  study  the  correlation  of  percent  breast  dense  area  between  different  views  of  the 
same  breast  and  between  the  same  view  of  the  two  breasts;  and  (4)  investigate  the  correlation 
between  the  percent  breast  dense  area  estimated  from  mammograms  and  the  volumetric  dense 
breast  tissue  estimated  from  a  data  set  of  magnetic  resonance  (MR)  breast  images.  These 
comparisons  will  provide  important  information  regarding  the  consistency  of  the  BI-RADS 
rating  with  the  measured  percent  breast  dense  area,  the  appropriate  measure  of  %  dense  area 
from  different  mammographie  views,  and  the  usefulness  of  using  the  percent  breast  dense  area 
on  mammograms  as  an  indicator  of  volumetric  breast  tissue  density. 

It  is  expected  that  this  project  will  produce  a  fully  automated  and  effective  tool  for 
analysis  of  mammographie  breast  density,  which  can  be  applied  to  routinely  acquired 
mammograms  without  special  calibrations.  This  will  facilitate  studies  of  various  factors 
associated  with  breast  cancer  risk  and  mammographie  sensitivity,  and  monitoring  the  effects  of 
interventional  or  preventive  strategies.  The  image  analysis  tool  will  therefore  contribute  to  the 
understanding  of  the  relationship  of  density  to  breast  cancer  risk,  detection,  prognosis,  and  to  the 
prevention  and  treatment  of  breast  cancer. 
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(5)  Body 


This  is  the  final  report  of  the  project.  We  have  described  in  detail  the  results  of  our 
studies  in  the  past  annual  progress  reports.  The  investigations  conducted  in  this  project  are 
summarized  in  the  following. 

(A)  Collection  of  a  Database  of  Full  Field  Digital  Mammograms  (DMs)  and  Digitized 

Mammograms  (DFM) 

With  IRB  approval,  we  have  collected  a  database  of  full  field  direct  digital  mammograms 
(DM)  and  a  database  of  screen-film  mammograms  (DFM).  The  database  of  DMs  contained 
about  290  cases  with  over  580  pairs  of  DMs.  The  database  of  DFMs  included  about  380  cases 
with  over  760  DFMs.  The  data  sets  were  used  for  the  development  of  the  breast  density  analysis 
tools  for  DMs  and  DFMs. 

(B)  Automated  Breast  Density  Analysis  System  for  Digitized  Mammograms 

We  developed  an  automated  image  analysis  tool  Mammographic  Density  ESTimator 
(MDEST),  to  assist  radiologists  in  estimating  breast  density  in  terms  of  percent  dense  area  on 
mammograms.  MDEST  performs  dynamic  range  compression,  breast  boundary  tracking, 
pectoral  muscle  segmentation  for  the  MLO  view,  automatic  thresholding  based  on  gray  level 
histogram  analysis,  and  calculates  the  percent  dense  area  on  a  mammogram.  In  our  study,  we 
found  that  the  correlation  between  the  computer-estimated  percent  dense  area  and  radiologists’ 
manual  segmentation  was  0.94  and  0.91,  respectively,  for  CC  and  MLO  views,  with  a  mean  bias 
of  less  than  2%.  These  results  indicated  that  the  automated  image  analysis  tool  can  be  an 
efficient  and  reproducible  method  for  breast  density  segmentation.  (Publications  Jl,  J2,  J3,  PI, 
P2,  Al,  A2,  A3,  A6,  A8) 

(C)  Correlation  between  Percent  Dense  Area  and  Percent  Volumetric  Fibroglandular 

Tissue 

We  performed  a  comparison  between  fibroglandular  tissue  volume  segmented  from  3- 
dimensional  MR  images  and  the  percent  dense  area  estimated  on  corresponding  DEMs.  The 
correlation  of  manually  segmented  percent  dense  area  of  the  CC  and  MLO  views  by  radiologists 
with  the  percent  volumetric  fibroglandular  tissue  on  MR  images  was  found  to  be  0.91  and  0.91, 
respectively.  The  correlation  of  automated  MDEST  segmented  percent  dense  area  of  the  CC  and 
MLO  views  with  the  percent  volumetric  fibroglandular  tissue  on  MR  images  was  found  to  be 
0.91  and  0.89,  respectively.  The  mean  differences  between  the  percent  dense  area  and  the  MR 
percent  volumetric  density  range  from  3%  to  6%.  The  high  correlation  indicated  that  the  percent 
dense  area  on  mammograms  can  be  used  as  a  surrogate  for  monitoring  breast  density  changes. 
(Publications  J2,  PI,  Al,  A3) 

(D)  Correlation  between  Breast  Density  estimated  on  Digitized  Screen-Film 

Mammograms  and  Full  Field  Digital  Mammograms 

We  studied  the  correlation  between  the  percent  dense  area  estimated  on  DFMs  and  DMs. 
Pairs  of  two-view  DFM  and  DM  from  99  patients  were  used.  All  mammograms  were  manually 
segmented  by  5  experienced  breast  radiologists  using  a  graphical  user  interface  developed  in  this 
project.  The  mean  difference  in  the  segmented  density  between  DFMs  and  DMs  was  about  3.5% 
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and  the  mean  ratio  was  about  1.30,  indieating  that  the  mammographie  density  was  about  30% 
higher,  on  average,  on  the  DFMs.  The  differenee  in  the  mammographie  density  may  be 
attributed  to  the  harder  beam  quality  used  and  the  digital  image  proeessing  applied  to  the  DMs. 
The  lower  density  may  improve  the  mammographie  sensitivity  for  lesion  deteetion  on  dense 
breasts.  However,  for  patients  with  DFMs  and  DMs  taken  over  time,  comparison  of  serial 
mammograms  for  breast  density  changes  will  be  problematic.  (Publications  A4) 

(E)  Automated  Breast  Density  Analysis  System  for  Digital  Mammograms 

The  MDEST  system  was  originally  developed  using  DFMs.  To  adapt  the  system  to  DMs, 
we  incorporated  an  Expectation-Maximization  (EM)  algorithm  to  extract  gray  level  features 
from  the  histogram.  A  rule -based  classifier  was  trained  to  estimate  a  gray  level  threshold  for 
segmenting  the  dense  area  from  the  breast  region  adaptively.  The  performance  of  the  MDEST 
system  for  DMs  was  improved  after  the  incorporation  of  the  EM  estimation  and  re-training.  The 
correlation  between  the  computer-estimated  percent  dense  area  and  the  radiologists’  manual 
segmentation  improved  from  0.85  and  0.87  to  0.94  and  0.92,  respectively,  for  CC  and  MLO 
views.  The  root-mean-square  (RMS)  errors  improved  from  7.3%  and  5.7%  to  4.2  and  4.4%, 
respectively,  for  CC  and  MLO  views.  (Publications  P2,  A5,  A6,  A8) 

(F)  Breast  Segmentation:  Breast  Boundary  Detection  and  Pectoral  Muscle  Trimming 

Breast  density  estimation  depends  strongly  on  the  accuracy  of  breast  boundary 
segmentation  and,  for  the  MLO  view,  pectoral  muscle  segmentation.  We  developed  an  improved 
breast  boundary  tracking  method  using  Sobel  edge  detection  in  combination  with  multiple  edge 
thresholding  to  generate  contour  images  and  active  contour  for  segmentation.  Eor  the  detection 
of  pectoral  boundary,  we  developed  a  method  that  included  gradient-based  directional  filtering, 
gradient-based  texture  analysis  to  generate  an  orientation  image,  mean  shift  smoothing,  and 
ridge-tracking.  Eor  a  data  set  of  118  MLO  view  mammograms,  the  new  method  identified 
99.2%  of  the  pectoral  muscles.  (Publications  P3,  A7) 

(G)  Comparison  of  Mammographie  Density  by  MDEST  Radiologists’  Estimates  and 

BI-RADS  Categories 

The  fully-automated  MDEST  computer  program  was  used  to  estimate  breast  density  on 
digitized  mammograms  from  65  women,  ages  24-89  (mean  =  53).  Live  breast  radiologists 
prospectively  assigned  qualitative  Bl-RADS  density  categories  and  visually  estimated  percent 
density  of  260  mammograms.  Qualitative  Bl-RADS  assessments  were  compared  to  new 
quantitative  Bl-RADS  standards.  The  reference  standard  density  for  this  study  was  established 
by  allowing  the  5  radiologists  to  manipulate  the  MDEST  gray  level  threshold,  which  segmented 
mammograms  into  dense  and  non-dense  areas.  It  was  found  that  there  was  close  correlation 
between  the  reference  standard  and  radiologist-estimated  density  (correlation  =.90-. 95)  and 
MDEST  density  (correlation  =  0.89).  MDEST  had  tighter  agreement  with  the  reference 
standard,  with  an  average  overestimate  of  1%  (-15%  to  +18%).  MDEST  correlated  better  with 
percent  density  than  qualitative  Bl-RADS  categories.  There  was  large  overlap  and  range  of 
percent  density  in  qualitative  Bl-RADS  categories  2  through  4.  Qualitative  Bl-RADS  categories 
correlated  poorly  with  the  new  quantitative  Bl-RADS  categories.  6%  (16/260)  of  views  were 
erroneously  classified  by  MDEST.  This  study  indicated  that  MDEST  compared  favorably  to 
radiologist  estimates  of  percent  density,  and  is  more  reproducible  than  radiologist  estimates 
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using  qualitative  BI-RADS  density  eategories.  Qualitative  and  quantitative  BI-RADS  density 

assessments  differed  markedly.  (Publieation  J3) 

(6)  Key  Research  Accomplishments 

•  Colleeted  over  670  eases  digitized  film  mammograms  and  digital  mammograms  for 
development  of  the  automated  density  segmentation  program  (Task  1). 

•  Developed  an  automated  image  analysis  tool  Mammographie  Density  ESTimator  (MDEST) 
for  estimation  of  the  pereentage  of  breast  dense  area  on  mammograms.  (Task  2) 

•  Evaluated  the  segmentation  aeouracy  by  comparison  with  radiologists’  manual  segmentation 
and  found  that  the  correlation  between  the  computer-estimated  percent  dense  area  and 
radiologists’  manual  segmentation  was  0.94  and  0.91,  respectively,  for  CC  and  MLO  views, 
with  a  mean  bias  of  less  than  2%.  (Task  2) 

•  Evaluated  correlation  between  mammographie  density  and  BI-RADS  density  ratings  and 
found  that  there  was  a  close  correlation  between  the  reference  standard  and  radiologist- 
estimated  density  (R=.90-.95)  and  MDEST  density  (R  =  0.89).  MDEST  correlated  better 
with  percent  density  than  qualitative  BI-RADS  categories.  (Task  2) 

•  Evaluated  correlation  between  mammographie  density  and  volumetric  density  measurement 
with  MR  breast  images  and  found  that  the  correlation  of  automated  MDEST  segmented 
percent  dense  area  of  the  CC  and  MLO  views  with  the  percent  volumetric  fibroglandular 
tissue  on  MR  images  was  0.91  and  0.89,  respectively.  The  mean  differences  between  the 
percent  dense  area  and  the  MR  percent  volumetric  density  ranged  from  3%  to  6%.  (Task  3) 

•  Evaluated  the  correlation  between  mamomgraphic  density  estimated  on  DEMs  and  DMs  and 
found  that  the  mean  difference  in  the  segmented  density  between  DEMs  and  DMs  was  about 
3.5%  and  the  mean  ratio  was  about  1.30,  indicating  that  the  mammographie  density  was 
about  30%  higher,  on  average,  on  the  DEMs.  (Tasks  2  and  4) 

•  Adapted  the  MDEST  density  segmentation  system  to  digital  mammograms  successfully  and 
the  system  now  can  be  applied  to  both  modalities  (Task  4) 

•  Compared  automated  density  segmentation  results  on  DMs  with  radiologists’  manual 
segmentation  and  found  that  the  correlation  between  the  computer-estimated  percent  dense 
area  and  the  radiologists’  manual  segmentation  was  0.94  and  0.92,  respectively,  for  CC  and 
MLO  views.  The  root-mean-square  (RMS)  errors  were  4.2  and  4.4%,  respectively,  for  CC 
and  MLO  views  (Task  4). 

•  Developed  a  breast  boundary  detection  method  and  a  pectoral  muscle  trimming  method  to 
improve  the  accuracy  of  breast  area  estimation  for  calculation  of  percent  dense  area.  Eor  a 
data  set  of  118  MLO  view  mammograms,  the  pectoral  muscle  trimming  method  identified 
99.2%  of  the  pectoral  muscles  (Task  2  and  Task  4). 
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(7)  Reportable  Outcomes 

As  a  result  of  the  support  by  the  USAMRMC  BCRP  grant,  we  have  developed  an 
automated  breast  density  analysis  system  for  both  DMs  and  DFMs.  We  have  also  developed 
graphieal  user  interfaees  for  interaetive  thresholding  of  breast  density  on  mammograms  by 
radiologists  effieiently.  The  results  of  these  investigations  have  been  presented  in  international 
conferenees  or  published  in  peer-reviewed  journals.  The  publieations  from  this  projeet  are  listed 
below.  Many  of  these  have  been  reported  in  the  annual  reports. 

Journal  Articles: 


Jl.  Zhou  C,  Chan  HP,  Petrick  N,  Helvie  MA,  Goodsitt  MM,  Sahiner  B,  Hadjiiski  LM. 
Computerized  image  analysis:  Estimation  of  breast  density  on  mammograms.  Medical 
Physics  2001:28:  1056-1069. 

J2.  Wei  J,  Chan  HP,  Helvie  MA,  Roubidoux  MA,  Sahiner  B,  Hadjiiski  L,  Zhou  C, 
Paquerault  S,  Chenevert  T,  Goodsitt  MM.  Correlation  between  Mammographic  Density 
and  Volumetric  Fibroglandular  Tissue  Estimated  on  Breast  MR  Images.  Medical  Physics 
2004;31:933-942. 

J3.  Martin  KE,  Helvie  MA,  Zhou  C,  Roubidoux  MA,  Bailey  JE,  Paramagul  C,  Blanc  CE, 
Klein  K,  Sonnad  S,  Chan  HP.  Mammographic  density  measured  by  an  automatic 
computer-aided  quantitative  method:  Comparison  with  radiologists’  estimates  and  BI¬ 
RADS  categories.  Radiology  2006;  240:  656-665. 


Conference  Proceeding: 

PI.  Chan  HP,  Hadjiiski  LM,  Roubidoux  MA,  Helvie  MA,  Paquerault  S,  Sahiner  B, 
Chenevert  T,  Goodsitt  MM.  Breast  density  estimation:  correlation  of  mammographic 
density  and  MR  volumetric  density.  In:  Digital  Mammography  IWDM  2002:  6th 
International  Workshop  on  Digital  Mammography.  Ed.  Peitgen  HO.  (Springer,  Berlin) 
2003:281-284. 

P2.  Zhou  C,  Chan  HP,  Wei  J,  Helvie  MA,  Roubidoux  MA,  Paramagul  C,  Nees  A,  Hadjiiski 
LM,  Sahiner  B.  Performance  evaluation  of  an  automated  breast  density  estimation  system 
for  digital  mammograms  and  digitized  film  mammograms.  In:  Digital  Mammography 
IWDM  2004:  7th  International  Workshop  on  Digital  Mammography.  Ed.  Pisano  E.  425- 
429. 

P3.  Zhou  C,  Hadjiiski  LM,  Paramagul  C,  Sahiner  B,  Chan  HP,  Wei  J.  Computerized  pectoral 
muscle  identification  on  MLO-view  mammograms  for  CAD  applications.  Proc  SPIE 
5747;  2005: 852-857. 


Conference  Abstracts  and  Presentations: 
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Al.  Chan  HP,  Hadjiiski  LM,  Roubidoux  MA,  Helvie  MA,  Paquerault  S,  Sahiner  B, 
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(8)  Conclusions 

We  have  developed  an  automated  mammographic  density  segmentation  system,  referred 
to  as  Mammographic  Density  ESTimator  (MDEST),  for  both  DMs  and  DEMs.  Our  studies 
showed  that  the  automated  MDEST  system  can  provide  percent  dense  area  estimates  that  are 
highly  correlated  with  radiologists’  interactive  thresholding  results  and  the  percent  volumetric 
fibroglandular  tissue  estimates  from  MR  breast  images.  The  quantitative  estimates  are  superior 
to  the  radiologists’  qualitative  BI-RADS  density  assessment.  The  MDEST  system  can  provide  a 
consistent  and  reproducible  estimation  of  percent  dense  area  on  routine  clinical  mammograms. 
This  will  facilitate  studies  of  various  factors  associated  with  breast  cancer  risk  and 
mammographic  sensitivity,  and  monitoring  the  effects  of  interventional  or  preventive  strategies. 
The  image  analysis  tool  will  therefore  contribute  to  the  understanding  of  the  relationship  of 
density  to  breast  cancer  risk,  detection,  prognosis,  and  to  the  prevention  and  treatment  of  breast 
cancers. 
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Computerized  image  anaiysis:  Estimation  of  breast  density 
on  mammograms 

Chuan  Zhou,  Heang-Ping  Chan,®*  Nicholas  Petrick,  Mark  A.  Helvie,  Mitchell  M.  Goodsitt, 
Berkman  Sahiner,  and  Lubomir  M.  Hadjiiski 

Department  of  Radiology,  The  University  of  Michigan,  Ann  Arbor,  Michigan  48109-0030 
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An  automated  image  analysis  tool  is  being  developed  for  the  estimation  of  mammographic  breast 
density.  This  tool  may  be  useful  for  risk  estimation  or  for  monitoring  breast  density  change  in 
prevention  or  intervention  programs.  In  this  preliminary  study,  a  data  set  of  4-view  mammograms 
from  65  patients  was  used  to  evaluate  our  approach.  Breast  density  analysis  was  performed  on  the 
digitized  mammograms  in  three  stages.  First,  the  breast  region  was  segmented  from  the  surrounding 
background  by  an  automated  breast  boundary-tracking  algorithm.  Second,  an  adaptive  dynamic 
range  compression  technique  was  applied  to  the  breast  image  to  reduce  the  range  of  the  gray  level 
distribution  in  the  low  frequency  background  and  to  enhance  the  differences  in  the  characteristic 
features  of  the  gray  level  histogram  for  breasts  of  different  densities.  Third,  rule-based  classification 
was  used  to  classify  the  breast  images  into  four  classes  according  to  the  characteristic  features  of 
their  gray  level  histogram.  For  each  image,  a  gray  level  threshold  was  automatically  determined  to 
segment  the  dense  tissue  from  the  breast  region.  The  area  of  segmented  dense  tissue  as  a  percentage 
of  the  breast  area  was  then  estimated.  To  evaluate  the  performance  of  the  algorithm,  the  computer 
segmentation  results  were  compared  to  manual  segmentation  with  interactive  thresholding  by  five 
radiologists.  A  “true”  percent  dense  area  for  each  mammogram  was  obtained  by  averaging  the 
manually  segmented  areas  of  the  radiologists.  We  found  that  the  histograms  of  6%  (8  CC  and  8 
MLO  views)  of  the  breast  regions  were  misclassified  by  the  computer,  resulting  in  poor  segmen¬ 
tation  of  the  dense  region.  For  the  images  with  correct  classification,  the  correlation  between  the 
computer-estimated  percent  dense  area  and  the  “truth”  was  0.94  and  0.91,  respectively,  for  CC  and 
MLO  views,  with  a  mean  bias  of  less  than  2%.  The  mean  biases  of  the  five  radiologists’  visual 
estimates  for  the  same  images  ranged  from  0.1%  to  11%.  The  results  demonstrate  the  feasibility  of 
estimating  mammographic  breast  density  using  computer  vision  techniques  and  its  potential  to 
improve  the  accuracy  and  reproducibility  of  breast  density  estimation  in  comparison  with  the 
subjective  visual  assessment  by  radiologists.  ©  2001  American  Association  of  Physicists  in  Medi¬ 
cine.  [DOT  10.1118/1.1376640] 

Key  words;  mammography,  computer-aided  diagnosis,  breast  density,  breast  cancer  risk,  image 
segmentation,  thresholding 


I.  INTRODUCTION 

Breast  cancer  is  one  of  the  leading  causes  for  cancer  mortal¬ 
ity  among  women.*  One  in  every  eight  women  will  develop 
breast  cancer  at  some  point  in  their  lives.  The  most  success¬ 
ful  method  for  the  early  detection  of  breast  cancer  is  screen¬ 
ing  mammography.  Currently,  mammograms  are  analyzed 
visually  by  radiologists.  Because  of  the  subjective  nature  of 
visual  analysis,  qualitative  responses  may  vary  from  radiolo¬ 
gist  to  radiologist.  Therefore,  a  computerized  method  for 
analyzing  mammographic  features  would  be  useful  as  a 
supplement  to  the  radiologist’s  assessment.  Previous  re¬ 
search  efforts  in  computer-aided  diagnosis  (CAD)  for  breast 
cancer  detection  mainly  concentrated  on  detection  and  char¬ 
acterization  of  masses  and  microcalcifications  on  mammo¬ 
grams  by  using  computer  vision  techniques.  It  has  been  dem¬ 
onstrated  that  an  effective  CAD  algorithm  can  improve  the 
diagnostic  accuracy  of  breast  cancer  characterization  on 
mammograms,  which,  in  turn,  may  reduce  unnecessary  biop¬ 
sies.  In  this  work,  we  are  studying  the  feasibility  of  develop¬ 


ing  a  CAD  system  for  an  analysis  of  breast  density  on  mam¬ 
mograms.  Studies  have  shown  that  there  is  a  strong  positive 
correlation  between  breast  parenchymal  density  on  mammo¬ 
grams  and  breast  cancer  risk.^  ®  The  relative  risk  is  estimated 
to  be  about  4  to  6  times  higher  for  women  whose  mammo¬ 
grams  have  parenchymal  densities  over  60%  of  the  breast 
area,  as  compared  to  women  with  less  than  5%  of  parenchy¬ 
mal  densities. 

An  important  difference  between  breast  density  as  a  risk 
factor  and  most  other  risk  factors  is  the  fact  that  breast  tissue 
density  can  be  changed  by  dietary  or  hormonal 
interventions.®’***’**  Although  there  is  no  direct  evidence  that 
changes  in  mammographic  breast  densities  will  lead  to 
changes  in  breast  cancer  risk,  the  strong  correlation  between 
breast  density  and  breast  cancer  risk  has  prompted  research¬ 
ers  to  use  mammographic  density  as  an  indicator  for  moni¬ 
toring  the  effects  of  intervention  as  well  as  for  studying 
breast  cancer  etiology.®’* *“*^ 

Different  methods  have  been  used  for  the  evaluation  of 
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mammographic  breast  density.  Earlier  studies  used  a  subjec¬ 
tive  visual  assessment  of  the  breast  parenchyma  primarily 
based  on  the  four  patterns  described  by  Wolfe^  (N1  is  com¬ 
prised  entirely  of  fat;  PI  has  up  to  25%  nodular  densities;  P2 
has  over  25%  nodular  mammographic  densities;  DY  contains 
extensive  regions  of  homogeneous  mammographic  densi¬ 
ties).  The  subjectivity  in  classifying  the  mammographic  pat¬ 
terns  introduced  large  variability  in  the  risk  estimation.  Later 
studies  used  more  quantitative  estimates,  such  as  planimetry, 
to  measure  the  dense  area  in  the  breast  manually  outlined  by 
radiologists  on  mammograms. These  studies  indicate  that 
the  percentage  (%)  of  mammographic  densities  relative  to 
the  breast  area  can  predict  the  breast  cancer  risk  more  accu¬ 
rately  than  a  qualitative  assessment  of  mammographic  pat¬ 
terns.  Warner  et  al}^  conducted  a  meta-analysis  of  the  stud¬ 
ies  published  between  1976  and  1990  to  investigate  the 
effect  of  different  methods  of  classification  on  estimates  of 
cancer  risk.  They  found  that  the  mammographic  parenchy¬ 
mal  pattern  does  correlate  with  the  breast  cancer  risk.  The 
magnitude  of  the  risk  varies  according  to  the  method  used  to 
evaluate  the  mammograms.  With  the  quantitative  estimates 
of  mammographic  density,  the  difference  in  risk  between  the 
highest  and  the  lowest  risk  category  is  substantial  and  is 
greater  than  the  risks  associated  with  most  other  risk  factors 
for  breast  cancer.  More  recent  studies  used  fractal  texture 
and  the  shape  of  the  gray  level  histogram*^  to  quantify  the 
parenchymal  pattern  or  used  interactive  thresholding  on  digi¬ 
tized  mammograms  to  segment  the  dense  area.*^’*^  It  was 
reported  that  the  thresholding  method  provided  a  higher  risk 
value  than  the  texture  measure  or  the  histogram  shape.*® 
Other  researchers  have  attempted  to  calculate  a  breast  den¬ 
sity  index  to  model  the  radiologists’  perception.*^ 

In  clinical  practice,  radiologists  routinely  estimate  the 
breast  density  on  mammograms  by  using  the  BI-RADS  lexi¬ 
con  as  recommended  by  the  American  College  of 
Radiology*®  in  order  to  provide  a  reference  for  mammo¬ 
graphic  sensitivity.  Because  of  the  lack  of  a  quantitative 
method  for  breast  density  estimation,  researchers  often  use 
the  BI-RADS  rating  for  monitoring  responses  to  preventive 
or  interventional  treatment  and  the  associated  changes  in 
breast  cancer  risk.*^  We  have  found  that  there  is  a  large 
interobserver  variability  in  the  BI-RADS  ratings  among  ex¬ 
perienced  mammographers.^**’^*  An  automated  and  quantita¬ 
tive  estimation,  as  investigated  in  this  study,  will  provide  not 
only  an  efficient  means  to  measure  mammographic  density, 
but  also  a  reproducible  estimate  that  will  reduce  the  inter- 
and  intraobserver  variability  of  mammographic  density  mea¬ 
surements.  This  image  analysis  tool  will  therefore  allow  re¬ 
searchers  to  study  more  definitively  the  relationship  of  mam¬ 
mographic  density  to  breast  cancer  risk,  detection,  prognosis, 
and  mammographic  sensitivity,  and  to  better  monitor  the  re¬ 
sponse  of  a  patient  to  preventive  or  interventional  treatment 
of  breast  cancers. 

In  this  paper,  we  will  describe  the  image  processing  tech¬ 
niques  used  in  our  automated  breast  density  segmentation 
algorithm.  The  performance  of  the  computer  segmentation 
was  evaluated  by  a  comparison  with  the  average  segmenta¬ 


tion  by  5  radiologists  using  interactive  thresholding  in  the 
same  data  set. 

II.  MATERIALS  AND  METHODS 

A.  Database 

A  data  set  consisting  of  260  mammograms  of  65  patients 
was  used  for  the  development  of  the  histogram  analysis 
method  in  this  study.  Each  case  contains  the  craniocaudal 
(CC)  view  and  the  mediolateral  oblique  (MLO)  view  of  both 
breasts  of  the  patient.  The  first  50  mammograms  were  con¬ 
secutive  screening  cases  from  the  patient  files  in  the  Radiol¬ 
ogy  Department  at  the  University  of  Michigan.  After  data 
analysis,  it  was  found  that  there  were  very  few  dense  breasts 
in  the  initial  data  set.  An  additional  15  cases  visually  judged 
by  radiologists  to  be  dense  breasts  were  then  randomly  se¬ 
lected  and  mixed  with  the  initial  set.  The  images  were  pro¬ 
cessed  individually  without  knowing  their  BI-RADS  catego¬ 
ries.  The  mammograms  were  acquired  with  mammography 
systems  approved  by  the  Mammography  Quality  Standards 
Act  (MQSA)  and  were  digitized  with  a  LUMISYS  85  laser 
film  scanner  with  a  pixel  size  of  50/U,mX50/U,m  and  4096 
gray  levels.  The  gray  levels  are  linearly  proportional  to  op¬ 
tical  densities  (O.D.)  from  0.1  to  greater  than  3  O.D.  units. 
The  nominal  O.D.  range  of  the  scanner  is  0-4  with  large 
pixel  values  in  the  digitized  mammograms  corresponding  to 
low  O.D.  The  full  resolution  mammograms  were  first 
smoothed  with  a  16X  16  box  filter  and  subsampled  by  a  fac¬ 
tor  of  16,  resulting  in  800  yU,mX800  /rm  images  of  approxi¬ 
mately  225X300  pixels  in  size  for  small  films  and  300 
X  375  pixels  for  large  films. 

B.  Breast  segmentation  and  image  enhancement 

The  breast  image  is  first  segmented  from  the  surrounding 
image  background  by  boundary  detection.  The  detected 
boundary  separated  the  breast  from  other  background  fea¬ 
tures  such  as  the  directly  exposed  area,  patient  identification 
information,  and  lead  markers.  The  density  analysis  was  per¬ 
formed  only  within  the  breast  region.  An  automated  breast 
boundary  tracking  technique  developed  previously^^’^®  was 
modified  to  improve  its  performance.  Briefly,  the  technique 
used  a  gradient-based  method  to  search  for  the  breast  bound¬ 
ary.  The  background  of  the  image  was  estimated  initially  by 
searching  for  the  largest  background  peak  from  the  gray 
level  histogram  of  the  image.  After  subtracting  this  back¬ 
ground  level  from  the  breast  region,  a  simple  edge  was  found 
by  a  line-by-line  gradient  analysis  from  the  top  to  the  bottom 
of  the  image.  The  criterion  used  in  detecting  the  edge  points 
was  the  steepness  of  the  gradient  of  four  adjacent  pixels 
along  the  horizontal  direction.  The  steeper  the  gradient,  the 
greater  the  likelihood  that  an  edge  existed  at  that  correspond¬ 
ing  image  point.  The  simple  edge  served  as  a  starting  point 
for  a  more  accurate  tracking  algorithm  that  followed.  The 
tracking  of  the  breast  boundary  started  from  approximately 
the  middle  of  the  breast  image  and  moved  upward  and  down¬ 
ward  along  the  boundary.  The  direction  to  search  for  a  new 
edge  point  was  guided  by  the  previous  edge  points.  The  edge 
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Fig.  1.  (a)  A  mammogram  from  our 
image  database;  (b)  the  image  super¬ 
imposed  with  the  detected  breast 
boundary  and  pectoral  muscle  bound¬ 
ary;  (c)  the  binary  map  of  the  seg¬ 
mented  breast  region. 


location  was  again  determined  by  searching  for  the  maxi¬ 
mum  gradient  along  the  gray  level  profile  normal  to  the 
tracking  direction.  Since  the  boundary  tracking  was  guided 
by  the  simple  edge  and  the  previously  detected  edge  points, 
it  could  steer  around  the  breast  boundary  and  was  less  prone 
to  diversion  by  noise  and  artifacts.  The  accuracy  of  the 
boundary  tracking  technique  was  evaluated  in  our  previous 
study^^  by  quantifying  the  root-mean-square  differences  be¬ 
tween  the  detected  and  manually  identified  breast  bound¬ 
aries.  In  the  current  study,  the  performance  of  the  boundary 
tracking  technique  for  this  data  set  was  determined  by  super¬ 
imposing  the  detected  boundary  on  the  breast  image  and  vi¬ 
sually  judged  if  the  detected  boundary  coincided  with  the 
perceived  breast  boundary.  The  breast  image  and  its  bound¬ 
ary  were  displayed  by  appropriately  adjusting  the  contrast 
and  brightness.  Incomplete,  jagged  and  mistracked  bound¬ 
aries  were  considered  incorrect  tracking. 

The  unexposed  film  area  around  the  film  edges  was  de¬ 
tected  automatically.  After  the  breast  boundary  was  found,  a 
region  growing  algorithm  was  used  to  fill  the  enclosed  breast 
region.  The  result  was  a  binary  map  that  distinguished  the 
breast  region  from  the  background  areas.  An  example  of  the 
tracked  breast  boundary  and  the  breast  binary  map  is  shown 
in  Figs.  1(a)- 1(c). 

For  the  MLO  view  mammograms,  an  additional  step  has 
to  be  performed  for  segmentation  of  the  pectoral  muscle.  The 
initial  edge  in  the  pectoral  region  was  found  as  the  maximum 
gradient  point  by  a  line-by-line  gradient  analysis  from  the 
chest  wall  to  the  breast  boundary.  The  false  pectoral  muscle 
edge  points  were  discarded  by  an  edge  validation  process. 
First,  a  straight  line  was  fitted  to  the  initial  edge  points,  and 
the  points  that  did  not  lie  close  to  the  fitted  line  were  re¬ 
moved.  Second,  the  remaining  edge  points  that  were  con¬ 
nected  were  identified  by  an  8-connectivity  criterion.  An 
edge  segment  was  removed  if  its  direction  was  inconsistent 
with  the  pectoral  edge  direction  relative  to  the  breast  image. 
Finally,  a  second  order  curve  was  fitted  to  the  remaining 
edge  points  to  separate  the  pectoral  muscle  from  the  breast 
region.  The  pixels  in  the  pectoral  muscle  region  were  ex¬ 
cluded  from  the  histogram  analysis  and  breast  area  calcula¬ 
tion.  The  accuracy  of  the  pectoral  muscle  detection  was  also 
judged  visually  in  this  study,  similar  to  the  method  used  for 
the  breast  boundary  described  above.  Figure  1  shows  the 
pectoral  muscle  trimming  result  for  an  MLO  view  mammo¬ 
gram. 


To  facilitate  histogram  analysis,  a  dynamic  range  com¬ 
pression  method  was  developed  to  reduce  the  gray  level 
range  of  the  histograms.  With  our  digitization,  the  gray  lev¬ 
els  of  the  dense  tissue  are  higher  than  those  of  the  adipose 
tissue.  Because  of  variations  in  exposure  condition  and 
breast  thickness  near  the  periphery,  the  gray  level  distribu¬ 
tion  corresponding  to  the  breast  parenchymal  pattern  is  su¬ 
perimposed  on  a  low  frequency  background  that  mainly  rep¬ 
resents  the  global  variations  in  exposure.  This  low  frequency 
background  distorts  the  characteristic  features  of  the  histo¬ 
gram  due  to  the  density  pattern.  To  reduce  the  distortion,  an 
adaptive  dynamic  range  compression  technique  was  applied 
to  the  breast  image.  For  a  given  breast  image,  F{x,y),  which 
contains  low  frequency  background  and  higher  frequency 
breast  tissue  structures,  a  smoothed  image,  Fg{x,y),  was 
obtained  by  applying  a  large-scale  box  filter  to  F{x,y)  to 
remove  the  high  frequency  components  while  retaining  the 
low  frequency  components.  The  image  Fg{x,y)  was  then 
compressed  by  a  scale  factor  k: 

Fcix,y)  =  kFg{x,y).  (1) 

To  reconstruct  the  high  frequency  components,  F(j(x,y), 
was  subtracted  from  a  constant  gray  level  G,  and  added  to 
the  original  image,  F{x,y): 

FD{x,y)  =  G-Fc{x,y),  (2) 

FE{x,y)=Fo{x,y)  +  F{x,y).  (3) 

Histogram  analysis  was  applied  to  the  dynamic -range- 
compressed  image  FE{x,y).  Figure  2  shows  an  example  of 
the  resulting  images  and  gray  level  histograms  obtained  from 
this  procedure,  where  the  size  of  box  filter  is  35X35,  the 
scale  factor  k  is  0.5,  and  the  constant  gray  level  G  is  the 
maximum  gray  level  of  the  compressed  image  F(^{x,y).  The 
values  of  these  parameters  were  chosen  experimentally  as  a 
balance  between  reducing  the  dynamic  range  and  preserving 
the  image  features  in  the  compressed  image. 

C.  Breast  density  segmentation  and  estimation 

A  rule-based  threshold  technique  was  developed  to  seg¬ 
ment  the  dense  areas  from  the  breast  background.  The  histo¬ 
gram  of  the  breast  region  on  the  dynamic-range-compressed 
mammogram  was  generated  and  smoothed.  The  histograms 
of  these  images  in  the  database  were  analyzed  to  formulate 
an  automatic  thresholding  routine.  The  histograms  were 
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(f) 


Fig.  2.  (a)  A  typical  mammogram  from  our  image  database;  (b)  the  low  frequency  image  Fg(x,y)  obtained  by  an  35  X  35  box  filter;  (c)  the  compressed  image 
Fc(x,y)\  (d)  the  inverted  image  Fu{x,y);  (e)  the  enhanced  image  F^{x,y)',  (f)  the  gray  level  histogram  within  the  breast  region  of  the  original  image  F{x,y)\ 
and  (g)  the  gray  level  histogram  of  the  breast  region  of  the  enhanced  image  Fi;(x,y). 


grouped  into  four  classes  based  on  the  characteristic  shapes 
of  their  histograms.  It  was  observed  that  the  grouping  corre¬ 
sponded  approximately  to  the  four  BI-RADS  breast  density 
ratings:  Class  I  corresponded  to  breasts  of  almost  entirely  fat, 
Class  II  corresponded  to  scattered  hbroglandular  densities, 
Class  III  corresponded  to  heterogeneously  dense  and  Class 
IV  corresponded  to  extremely  dense  breasts.  Examples  of 
typical  histograms  for  these  four  classes  are  shown  in  Fig.  3. 
The  histograms  seemed  to  follow  two  basic  patterns.  In  one 
pattern,  there  was  only  one  dominant  peak,  which  repre¬ 
sented  most  of  the  breast  structures  in  the  breast  region.  In 
the  other  pattern,  in  addition  to  a  large  peak  in  the  histogram, 
there  was  one  or  two  smaller  peaks  on  the  right  or  left  side  of 
the  large  peak.  In  a  majority  of  the  cases,  the  smaller  peak 
was  distinguishable  from  the  large  one  when  the  random 
fluctuation  on  the  histogram  was  smoothed. 


1.  Peak  detection  and  feature  description 

The  gray  level  histogram  within  the  breast  area  was  gen¬ 
erated  and  normalized,  and  passed  through  an  averaging  win¬ 
dow  to  smooth  out  the  random  fluctuations.  We  estimated 
the  window  size  to  be  in  the  range  of  30  to  50  gray  levels  by 
experimentally  evaluating  the  histogram  shapes  and  density 
segmentation  at  different  window  sizes.  Too  small  a  window 
size  cannot  smooth  out  the  fluctuation  and  too  large  a  win¬ 
dow  size  will  blur  the  useful  features.  A  window  size  of  30 
was  used  in  this  study.  The  second  derivative  of  every  point 
on  the  histogram  curve  was  computed.  An  example  of  the 
histogram  and  its  second  derivative  curve  are  shown  in  Fig. 
4.  The  zero  crossing  locations  were  detected  by  scanning  for 
the  positive-to-negative  and  negative-to-positive  changes  on 
the  latter  curve.  If  the  second  derivative  was  negative  be- 
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Fig.  3.  Four  typical  classes  of  histograms  and  the  setting  of  gray  level  interval  [g]  ,g2]  for  the  threshold  calculation. 


tween  two  zero  crossing  points,  it  indicated  that  a  peak  ex¬ 
isted  between  these  two  points  on  the  histogram.  Normally, 
as  shown  in  Fig.  4,  a  peak  included  the  peak  point  Pq  and 
two  valley  points  Pj  and  P2  located  on  the  two  sides  of  the 
peak  point.  The  peak  point  Pq  was  determined  by  searching 
for  the  maximum  histogram  value  between  the  zero  crossing 
points  Z2  and  Z3 ,  and  the  Pj  and  P2  points  were  obtained  by 
searching  for  the  point  with  minimum  histogram  value  be¬ 
tween  zero  crossing  points  and  Z3,Z4,  respectively. 

The  following  peak  features  can  be  defined  by  peak  point 
Pq  and  valley  points  Pi  and  P2: 

1 

Energy:  P=  7  2  /(0*/(0,  (4) 

A  ,  =  pj 

1 

left-side  energy:  £z.  =  7  2  /(0*/(0,  (5) 

A  ,  =  pj 


Gray  level 

Fig.  4.  The  gray  level  hi.stogram  (solid  curve)  and  the  second  derivative 
(dot)  cui've.  Pq  is  the  peak  point,  P|  and  P-i  are  the  valley  points  of  the  peak 
on  the  two  sides  of  the  peak  point  Pq  .  Points  Z, ,  Z2 ,  Z3  and  Z4  are  zero 
crossing  points  on  the  second  derivative  curve,  which  are  used  for  searching 
the  points  Pq,  and  P2. 


1  ^ 

right-side  energy:  £«  =  -  2  /(0*/(0,  (6) 

A  i=PQ 

likelihood:  L  =  E/E',  (7) 

where  /( • )  is  the  histogram,  A  is  the  total  energy  of  the 
entire  histogram  and  A  =  X^o/(;)*/(;),N  is  the  maximum 
gray  level  of  the  histogram.  E'  is  the  energy  calculated  by 
approximating  the  histogram  in  the  interval  [Pi,P2]  using 
two  straight  lines,  PiPq  and  P0P2  ■  The  energy  E  of  the  peak 
is  used  to  compare  the  sizes  of  the  peaks  on  the  histogram, 
higher  energy  means  bigger  size  of  the  peak.  and  split 
the  energy  E  into  two  parts  from  the  peak  point  for  calculat¬ 
ing  the  ratio  of  the  energy  in  these  two  parts.  The  likelihood 
L  describes  how  close  the  real  peak  is  to  the  triangle  repre¬ 
sented  by  the  three  points  Pq,  Pi  and  ^2- 

2.  Rule-based  histogram  classification 

A  rule-based  histogram  classifier  was  developed  to  clas¬ 
sify  the  gray  level  histogram  of  the  breast  area  into  four 
classes.  As  shown  in  Fig.  3,  a  typical  Class  I  breast  is  almost 
entirely  fat,  it  has  a  single  narrow  peak  on  the  histogram. 
Class  II  has  scattered  fibroglandular  densities,  it  has  two 
peaks,  other  than  the  tail  part  on  the  left,  on  the  histogram, 
with  the  smaller  peak  on  the  right  of  the  bigger  one.  Class  III 
is  heterogeneously  dense,  it  also  has  two  peaks,  but  the 
smaller  peak  is  on  the  left  of  the  bigger  one.  Class  IV  is 
extremely  dense,  which  has  a  single  dominant  peak  on  the 
histogram,  but  it  is  wider  compared  with  the  peak  in  the 
Class  I  histogram,  and  a  second  small  peak  sometimes  oc¬ 
curs  to  the  left  of  the  main  peak. 

The  classification  is  performed  in  two  steps.  In  the  first 
step,  the  computer  determines  whether  there  is  only  one 
single  peak  in  the  histogram.  The  biggest  peak  (main  peak) 
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P and  its  location  are  detected  by  comparing  the  energy  of 
the  peaks  on  the  histogram.  The  single  peak  feature  is  mainly 
determined  by  the  energy  E  under  the  main  peak  and  the 
features  Ei  and  Ej^.  If  the  histogram  is  found  to  have  a 
single-peak  pattern,  in  general,  a  narrow  peak  corresponds  to 
very  fatty  breast  (Class  I),  and  a  wider  peak  corresponds  to 
very  dense  breast  (Class  IV).  However,  in  some  cases,  the 
histogram  of  these  two  classes  is  very  similar,  as  discussed 
below  (Fig.  9),  and  it  is  difficult  to  distinguish  them  by  their 
gray  level  histogram  distributions.  Two  additional  image  fea¬ 
tures  were  analyzed  to  classify  very  fatty  and  very  dense 
breasts.  One  feature  is  the  gray  level  standard  deviation  (Std) 
in  the  entire  breast  area,  defined  as 


/I  _  \ 

Std=  -  2  2  if{x,y)-f{x,y)f\  ,  (8) 

\  ^  X  e  MAP  y  e  MAP  / 


where  MAP  is  the  breast  binary  map  region,  N  is  the  pixel 
numbers  within  MAP.  Another  feature  is  the  number  of 
single  pixels  and  single  pixel-size  holes  (NSH)  counted  in 
the  breast  area  of  a  segmented  binary  image  using  the  big¬ 
gest  histogram  peak  point  as  a  threshold.  For  a  very  fatty 
mammogram,  the  breast  mainly  consists  of  a  fatty  back¬ 
ground  with  some  fibrous  structures  and  fibroglandular  tissue 
scattered  in  the  breast  area.  The  NSH  value  was  found  to  be 
larger  (greater  than  50  pixels  on  average),  and  Std  smaller 
(less  than  500  on  average),  compared  with  a  mammogram  of 
a  very  dense  breast. 

In  the  second  step,  if  the  histogram  is  found  to  have  more 
than  one  peak,  decision  rules  are  used  to  decide  if  the  second 
major  peak  is  on  the  left  side  or  on  the  right  side  of  by 
the  features  E,  E^,  Ef,  and  L,  and  the  relative  position  of  the 
two  peaks.  If  the  second  major  peak  is  on  the  right,  then  the 
histogram  is  classified  to  be  Class  II;  otherwise,  it  is  classi¬ 
fied  to  be  Class  III. 


3.  Gray  level  thresholding 

Gray  level  thresholding  is  essentially  a  pixel  classification 
problem.  Its  objective  is  to  classify  the  pixels  of  a  given 
image  into  two  classes:  one  includes  pixels  with  gray  values 
that  are  below  or  equal  to  a  certain  threshold;  the  other  in¬ 
cludes  those  with  gray  values  above  the  threshold.  Thresh¬ 
olding  is  a  popular  tool  for  image  segmentation,  a  variety  of 
techniques  have  been  proposed  over  the  years.  In  our  study, 
two  threshold  selection  methods  are  used:  one  is  the  Dis¬ 
criminant  Analysis  (DA)  method^"^  and  the  other  is  the  Maxi¬ 
mum  Entropy  Principle  (MEP)  based  method. The  DA 
method  assumes  that  the  image  gray  levels  can  be  classified 
into  two  classes  by  a  threshold.  To  estimate  the  threshold,  a 
discriminant  criterion  based  on  the  within-class  variance  and 
between-class  variance  is  introduced.  An  optimal  threshold 
is  selected  by  the  discriminant  criterion  to  maximize  the 
separability  of  the  resultant  classes  in  terms  of  gray  levels. 
This  method  is  well-suited  for  the  cases  where  the  gray  level 
histogram  is  bimodal.  In  an  ideal  situation,  the  histogram  has 
a  deep  and  sharp  valley  between  the  two  peaks  representing 
objects  and  background,  respectively,  and  the  optimum  cor¬ 
responds  to  the  gray  level  at  the  bottom  of  this  valley.  A 


more  detailed  description  of  the  DA  method  can  be  found  in 
Appendix  A. 

Eor  the  MEP  method,  the  optimal  threshold  value  is  de¬ 
termined  by  maximizing  the  a  posteriori  entropy  subject  to 
certain  inequality  constraints  that  are  derived  by  means  of 
special  measures  characterizing  the  uniformity  and  the  shape 
of  the  regions  in  the  image.  As  is  well-known,^®  the  maxi¬ 
mum  a  posteriori  probability  can  serve  as  a  criterion  to  se¬ 
lect  a  priori  probability  distributions  when  very  little  is 
known  about  the  probability  distribution.  Compared  with  the 
DA  method,  MEP  can  provide  a  better  thresholding  result  if 
the  gray  level  histogram  does  not  have  a  bimodal  distribu¬ 
tion.  A  more  detailed  description  of  the  MEP  method  can  be 
found  in  Appendix  B. 

The  gray  level  histograms  of  the  mammograms  in  our 
study  are  very  complex,  the  histogram  may  be  unimodal, 
bimodal  or  multi-modal.  It  is  difficult  to  select  an  appropriate 
threshold  by  one  general  threshold  selection  method.  There¬ 
fore,  we  combined  both  the  DA  and  the  MEP  methods,  to 
select  a  threshold  according  to  the  characteristic  features  of 
the  histogram  that  has  been  classified  into  one  of  the  four 
classes.  Suppose  f{g)  is  the  gray  level  histogram  of  the 
breast  area.  Let  7’=Method(/(g)|gi<g<g2)  represent  the 
threshold,  T,  that  is  selected  by  use  of  Method  in  the  interval 
[^1  ’82]  of  the  histogram  f{g),  where  Method  can  be  either 
the  DA  or  MEP  method.  The  settings  of  the  interval  [gj  ,^2] 
for  the  four  classes  are  discussed  below  and  shown  in  Pig.  3. 

Class  I:  The  histogram  is  unimodal  so  that  the  threshold  is 
selected  as 

r=MEP(/(g)|gi<g<g2), 

where,  gi  is  the  main  peak  point;  g2  is  the  valley  point  on 
the  right  side  of  main  peak. 

Class  II:  The  histogram  is  not  unimodal  and  the  histogram 
is  classified  as  Class  II;  the  threshold  is  selected  by  averaging 
two  thresholds  that  are  computed  in  two  different  intervals  of 
the  histogram  by  the  DA  method: 

7’i  =  DA(/(g)|g>gi), 

7’2  =  DA(/(g)|g>g2), 

T=iTi  +  T2)/2, 

where  gj  is  the  valley  on  the  left  of  the  main  peak;  g2  is  the 
main  peak  point. 

Class  III:  The  histogram  is  not  unimodal;  there  are  two 
possibilities  in  the  histogram  distribution:  there  is  a  valley 
between  the  main  peak  and  its  left  side  peak,  as  shown  in 
Pig.  3,  or  no  obvious  valley  exists  between  the  main  peak 
and  its  left  side  peak.  In  two  different  intervals  of  the  histo¬ 
gram,  two  thresholds  are  computed  as 

7’i  =  DA(/(g)|gi<g<g2), 

7’2  =  DA(/(g)|g;<g<g2), 

where  gj  is  the  left  valley  point  of  the  left-side  peak  (Plm) 
of  the  main  peak,  g|  is  the  peak  point  of  .?2  is  right 

valley  point  of  the  main  peak.  If  there  is  an  obvious  valley, 
T=(Ti  +  72)12,  otherwise  T=  T j . 
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Class  IV:  Since  the  histogram  is  considered  unimodal,  the 
threshold  is  computed  by  the  MEP  method,  T 
=  MEP(/(g)|gi<g<g2),  where,  gi  is  the  left  valley  point 
of  the  main  peak;  g2  is  the  main  peak  point. 

D.  Radiologists’  segmentation  of  dense  breast  tissue 

In  order  to  evaluate  the  accuracy  of  the  computer  segmen¬ 
tation  method,  the  computer  segmentation  results  were  com¬ 
pared  to  radiologists’  manual  segmentation  in  the  data  set  of 
65  patient  cases.  Details  of  the  observer  study  for  estimation 
of  the  breast  density  and  statistical  analysis  of  the  results 
were  discussed  elsewhere. Briefly,  a  graphical  interface 
was  developed  for  displaying  the  mammograms  and  record¬ 
ing  the  observer’s  evaluation.  The  CC-view  and  MLO-view 
mammograms  for  a  given  breast  were  displayed  side-by- 
side;  a  radiologist  observer  examined  the  mammograms  and 
gave  a  BI-RADS  rating  and  a  visual  estimation  of  the  per¬ 
cent  breast  density  with  10%  increments.  After  the  subjective 
evaluation,  each  view  was  displayed  sequentially,  together 
with  the  histogram  of  the  dynamic-range-compressed  image. 
The  radiologist  would  interactively  choose  a  threshold  by 
moving  a  slider  along  the  abscissas  of  the  histogram  plot. 
The  segmented  binary  image,  displayed  side-by-side  with  the 
mammogram,  would  change  instantaneously  when  the 
threshold  was  changed.  The  radiologist  could  inspect  if  the 
segmented  area  corresponded  to  the  dense  area  on  the  mam¬ 
mogram.  Once  the  radiologist  was  satisfied  with  the  segmen¬ 
tation  of  the  dense  area,  the  gray  level  threshold  and  the 
percent  dense  area  derived  from  this  threshold  were  re¬ 
corded.  The  display  then  moved  to  the  next  view  of  the  same 
breast  for  evaluation.  The  mammograms  of  the  other  breast 
for  the  same  patient  would  then  be  displayed  and  evaluated 
in  the  same  way.  The  entire  process  was  repeated  for  each 
patient  until  all  patients  in  the  data  set  were  evaluated. 

Eive  MQSA-approved  radiologists  participated  in  the  ex¬ 
periment.  To  familiarize  the  radiologists  with  the  procedures 
and  to  assist  them  in  their  visual  estimation  of  the  percent 
breast  density,  we  had  them  trained  on  a  separate  set  of  25 
patient  cases  prior  to  the  evaluation  of  the  actual  data  set. 
During  the  training  session,  the  computer  displayed  the  per¬ 
cent  breast  dense  area  to  the  radiologist,  which  was  obtained 
by  the  radiologist’s  interactive  thresholding  of  the  image. 
The  radiologist  could  then  compare  the  manually  segmented 
percentage  with  their  visually  assessed  percent  density  for 
the  image.  This  feedback  helped  “calibrate”  the  radiolo¬ 
gists’  visual  estimates  of  the  percent  dense  breast  area.  The 
percent  dense  area  obtained  by  interactive  thresholding  was 
not  displayed  during  the  actual  study. 

III.  RESULTS 

An  example  of  a  typical  mammogram  from  each  of  the 
four  classes  and  its  corresponding  enhanced  image,  its  histo¬ 
gram,  the  selected  threshold  and  the  segmented  image  are 
shown  in  Eigs.  5(a)-5(d),  respectively. 

The  average  percent  breast  density  obtained  from  manual 
segmentation  by  the  five  trained  radiologists  for  each  mam¬ 
mogram  was  used  as  the  “true  standard”  of  the  percent 


breast  density  for  that  mammogram.  The  breast  region  was 
segmented  by  the  breast  boundary  tracking  technique,  and 
the  pectoral  muscle  was  trimmed  for  the  MLO-view  mam¬ 
mograms.  The  breast  boundary  was  accurately  tracked  on 
92.3%  (240/260)  of  the  mammograms,  and  the  pectoral 
muscle  was  correctly  trimmed  on  74.6%  (97/130)  of  the 
MLO  views.  The  histograms  of  6%  (8  CC  views  and  8  MLO 
views)  of  the  breast  regions  did  not  exhibit  the  typical  char¬ 
acteristic  features  of  the  four  classes  and  were  misclassifled 
by  the  computer,  resulting  in  poor  segmentation  of  the  dense 
region. 

Figure  6  shows  a  comparison  of  the  percent  breast  density 
visually  estimated  by  radiologists  against  the  true  standard 
for  the  94%  of  the  260  mammograms  that  were  classified 
correctly  by  the  computer.  Table  I  summarizes  the  compari¬ 
son  of  the  radiologists’  visual  estimates  with  the  true  stan¬ 
dard.  The  ‘  ‘difference’  ’  between  the  estimated  %  breast  den¬ 
sity  and  the  true  standard  was  calculated  for  each  case,  and 
the  mean  and  the  standard  deviation  of  this  difference  over 
all  cases  were  estimated  for  each  radiologist  and  shown  in 
the  table.  Therefore,  the  mean  difference  was  the  average 
bias  of  the  estimated  %  breast  density  from  the  true  standard 
over  all  images  in  the  data  set.  It  can  be  seen  that  almost  all 
radiologists  had  a  positive  bias,  on  average,  when  they  visu¬ 
ally  estimated  mammographic  density,  except  for  Radiolo¬ 
gist  5  who  had  a  small  negative  average  bias  on  the  CC-view 
reading.  For  a  given  radiologist,  the  over-estimation  in¬ 
creased  as  the  breast  density  increased.  Although  the  corre¬ 
lation  coefficients  were  high,  ranging  from  0.90  to  0.95,  the 
deviations  from  the  diagonal  line  were  systematic.  The  aver¬ 
age  bias  from  the  true  standard  varied  from  less  than  1%  to 
11%,  depending  on  the  radiologist.  The  root-mean-square 
(RMS)  errors  of  the  five  radiologists  relative  to  the  true  stan¬ 
dard  ranged  from  7.5%  to  16.3%. 

Figure  7  shows  the  comparison  of  the  percent  breast  den¬ 
sity  between  the  computer  segmentation  and  the  true  stan¬ 
dard  for  the  94%  of  mammograms  whose  histograms  were 
considered  to  be  correctly  classified.  There  was  a  trend  of 
over-estimation  in  the  very  fatty  breasts.  In  the  medium 
dense  range,  the  variances  from  the  true  standard  were  high. 
Some  images  had  a  large  deviation  from  the  diagonal  line, 
indicating  that  the  threshold  was  incorrectly  determined. 
Table  II  summarizes  the  comparison  between  the  computer 
performance  and  the  true  standard.  For  the  CC  views  with 
correct  histogram  classification,  the  correlation  between  the 
computer-estimated  percent  dense  area  and  the  true  percent 
breast  density  was  0.94,  and  between  the  computer  and  the 
radiologists’  average  visual  estimate  was  0.87  (not  plotted). 
These  correlation  coefficients  were  0.91  and  0.82,  respec¬ 
tively,  for  the  MLO  views  with  correct  classification.  Al¬ 
though  the  correlation  coefficients  of  the  computer  segmen¬ 
tation  with  the  true  standard  were  not  better  than  those  of  the 
visual  estimates,  the  average  biases  of  the  computer  segmen¬ 
tation  from  the  true  standard  were  less  than  2%,  which  were 
substantially  less  than  those  of  visual  estimates  (Table  I). 
This  indicates  that  computerized  segmentation  is  a  good  al¬ 
ternative  to  manual  segmentation  although  variances  of  the 
automated  method  will  need  to  be  further  reduced.  The  RMS 
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Fig.  5.  Four  classes  of  typical  mammograms  and  con'esponding  enhanced  and  segmented  image,  histogram  and  threshold. 


errors  of  the  computer  segmentation  were  also  less  than 
those  of  the  radiologists’  visual  estimates,  at  6.1%  and  7.2%, 
respectively,  for  the  CC  view  and  MLO  view,  when  the  his¬ 
tograms  were  correctly  classified.  The  biases  and  RMS  errors 
for  the  different  subsets  of  images  are  also  shown  in  Table  II. 
It  can  be  seen  that  correct  histogram  classification  was  the 
most  important  factor  in  reducing  the  biases  and  the  RMS 
errors.  The  contributions  by  breast  boundary  detection  and 
pectoral  muscle  segmentation  were  minor,  on  average,  for 
improving  the  estimation  of  the  percent  dense  breast  area. 

Figure  8  shows  the  comparison  of  the  individual  radiolo¬ 
gists’  manual  segmentation  against  the  true  standard.  For  CC 
views,  the  RMS  difference  in  the  percent  breast  density  be¬ 


tween  an  individual  radiologist’s  manual  segmentation  and 
the  true  standard  varied  from  2.9%  to  5.9%  among  the  five 
radiologists.  For  MLO  views,  the  RMS  difference  varied 
from  2.8%  to  6.2%.  The  average  biases  of  the  five  radiolo¬ 
gists  ranged  from  —2.8%  to  2.2%  for  the  CC  views  and  from 
—  3.1%  to  3.0%  for  the  MLO  views.  The  maximum  biases  of 
the  five  radiologists  varied  from  4.4%  to  22.6%  for  the  CC 
views  and  from  5.2%  to  23%  for  the  MLO  views. 

The  five  radiologists  provided  BI-RADS  density  ratings 
for  each  breast.  Although  the  BI-RADS  ratings  exhibited 
large  inter-observer  variations,^®  it  is  interesting  to  compare 
the  computer’s  histogram  classification  with  the  BI-RADS 
ratings.  Since  there  were  260  images,  each  with  5  radiolo- 
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Fig.  6.  A  comparison  of  the  percent  breast  density  between  five  radiologists’ 
visual  estimates  and  the  true  standard.  The  dashed  line  represents  the  linear 
regression  of  all  data  points  on  the  plot.  The  MLO  view  is  shown.  The  trend 
for  the  CC  view  is  similar. 

gists’  ratings,  there  were  a  total  of  1300  rating  comparisons. 
The  comparison  of  the  computer  and  the  radiologists’ 
BI-RADS  ratings  is  shown  in  Table  III.  It  was  found  that 
87.4%of  Class  I  classification  have  BI-RADS  ratings  I  or  2, 
92.0%  of  Class  II  classifications  have  density  ratings  2  or  3, 
83.4%  of  Class  III  classihcations  have  density  ratings  3  or  4 


and  57.1%  of  Class  IV  classifications  have  density  rating  4. 
More  detailed  analysis  of  the  variability  of  radiologists’  BI¬ 
RADS  ratings  was  discussed  by  Martin  et 

IV.  DISCUSSION 

Radiologists  routinely  estimate  mammographic  breast 
density  using  the  four  BI-RADS  categories.  In  studies  that 
require  breast  density  estimation,  radiologists’  visual  esti¬ 
mates  of  mammographic  density  were  often  used  as  the  den¬ 
sity  measure.  Our  observer  study  indicates  that  interobserver 
variation  between  the  BI-RADS  ratings  of  hve  experienced 
radiologists  ranged  from  —1  to  -fl.  The  subjectively  esti¬ 
mated  percent  dense  area  can  deviate  from  the  true  standard 
by  as  much  as  40%,  as  shown  in  Fig.  6.  These  results  indi¬ 
cate  the  need  to  develop  an  objective  method  for  the  estima¬ 
tion  of  mammographic  breast  density  in  order  to  improve  the 
accuracy  and  reproducibility  of  the  estimation.  A  computer¬ 
ized  image  analysis  method  for  mammographic  breast  den¬ 
sity  estimation  will  be  a  useful  tool  for  study  of  breast  cancer 
risk  factors  and  for  monitoring  the  change  of  breast  cancer 
risk  with  preventive  or  interventional  treatments. 

In  this  study,  we  used  the  average  of  the  percent  breast 
area  obtained  with  interactive  thresholding  by  hve  experi¬ 
enced  radiologists  as  the  true  standard.  The  gray  level  thresh¬ 
olding  method  used  in  this  study  could  achieve  a  reasonable 
segmentation  of  the  dense  areas  on  the  mammogram  because 
the  image  was  preprocessed  with  dynamic  range  compres¬ 
sion.  The  image-based  analysis  of  breast  density  will  not 
provide  the  actual  percentage  of  hbroglandular  tissue  in  the 
breast  volume.  However,  the  previous  studies  that  estab- 


Table  L  A  comparison  of  the  radiologists’  visual  estimate  of  mammographic  breast  density  with  the  true 
standard.  The  “difference”  was  defined  as  the  difference  between  the  estimated  %  breast  density  and  the  true 
standard  for  each  case,  and  the  mean  and  the  standard  deviation  of  this  difference  are  tabulated. 


No.  of 

RMS 

Mean 

Std.  dev.  of 

Image  subsets 

images 

Radiologist 

Correlation 

error 

difference 

difference 

CC  view: 

All 

130 

Rad.  1 

0.942 

13.3% 

6.9% 

11.5% 

Rad.  2 

0.931 

14.5% 

9.8% 

10.7% 

Rad.  3 

0.923 

13.3% 

6.3% 

11.8% 

Rad.  4 

0.934 

7.5% 

2.9% 

7.0% 

Rad.  5 

0.901 

9.6% 

-1.4% 

9.6% 

Histogram 

122 

Rad.  1 

0.946 

13.7% 

7.2% 

11.3% 

correctly 

Rad.  2 

0.936 

14.7% 

10.3% 

10.8% 

classified 

Rad.  3 

0.929 

14.2% 

6.7% 

11.6% 

Rad.  4 

0.929 

7.7% 

3.1% 

7.1% 

Rad.  5 

0.900 

9.7% 

-1.3% 

9.4% 

MLO  view: 

All 

130 

Rad.  1 

0.933 

14.5% 

8.3% 

12.0% 

Rad.  2 

0.914 

16.1% 

11.2% 

11.5% 

Rad.  3 

0.915 

14.4% 

7.7% 

12.2% 

Rad.  4 

0.919 

8.8% 

4.3% 

7.7% 

Rad.  5 

0.910 

9.2% 

0.1% 

9.2% 

Histogram 

122 

Rad.  1 

0.932 

15.0% 

8.3% 

12.0% 

correctly 

Rad.  2 

0.914 

16.3% 

10.9% 

11.4% 

classified 

Rad.  3 

0.919 

14.7% 

7.8% 

12.2% 

Rad.  4 

0.916 

9.0% 

4.3% 

7.7% 

Rad.  5 

0.909 

9.4% 

0.3% 

9.2% 
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CC  view 


MLO  view 


Fig.  7.  A  comparison  of  the  percent  breast  density  between  the  computer 
segmentation  and  the  true  standard.  The  dashed  line  represents  the  linear 
regression  of  the  data  on  the  plot,  (a)  CC  view,  (b)  MLO  view. 


lished  the  correlation  between  breast  density  and  breast  can¬ 
cer  risk  were  all  based  on  mammographic  density.  This  in¬ 
dicated  that  mammographic  density  is  a  sufficiently  sensitive 
marker  for  breast  cancer  risk,  although  it  may  be  less  accu¬ 
rate  than  volumetric  density.  An  actual  measurement  of  the 
percentage  of  fibroglandular  tissue  volume  in  the  breast,  for 
example,  by  x-ray  penetration  with  correction  for  scatter  and 
beam  hardening,  is  difficult  because  it  requires  accurate 
x-ray  sensitometry  or  phantom  calibration  for  each  image. 
These  requirements  will  limit  its  use  to  a  few  laboratories 
that  have  specialized  equipment  and  expert  physicists.  Mag¬ 
netic  resonance  breast  imaging  can  also  provide  volume 
measurement  of  dense  tissue  but  it  is  expensive  and  not  eas¬ 
ily  accessible.  It  can  be  expected  that  the  estimation  of  mam¬ 
mographic  breast  density  by  a  computerized  image  analysis 
method  will  be  a  more  practical  and  viable  approach,  espe¬ 


cially  when  direct  digital  mammography  becomes  more 
widely  used  in  the  future. 

Our  preliminary  study  indicates  that  breast  density  esti¬ 
mation  can  be  performed  automatically  and  accurately  (Fig. 
7).  Although  the  accuracy  of  our  current  algorithm  still  needs 
to  be  improved,  it  can  be  seen  that  the  computer  segmenta¬ 
tion  can  provide  an  estimate  of  the  percent  breast  density 
with  a  very  small  bias  (Table  II).  More  importantly,  com¬ 
puter  segmentation  will  be  more  reproducible  and  consistent 
than  visual  estimates.  This  will  improve  the  sensitivity  of 
studies  that  depend  on  evaluation  of  the  change  in  mammo¬ 
graphic  density  over  time  or  before  and  after  a  certain  treat¬ 
ment. 

In  this  study,  we  reduced  the  spatial  resolution  to  a  pixel 
size  of  800 /rmX  800 /rm  for  image  processing.  The  small 
matrix  size  of  the  reduced  images  improves  the  computa¬ 
tional  efficiency.  The  reduction  in  resolution  has  two  major 
effects:  reducing  the  image  noise  and  blurring  the  details. 
Since  the  significant  dense  tissue  in  the  breast  that  contrib¬ 
utes  to  the  parenchyma  is  relatively  large  compared  to  800 
/rm,  it  is  not  expected  that  processing  at  this  pixel  size  will 
have  a  strong  effect  on  the  accuracy  of  the  estimated  percent 
breast  density.  Differences  in  the  segmented  area  may  occur 
mainly  along  the  boundary  of  the  dense  tissue  region,  but  the 
effect  may  be  averaged  out  statistically  along  boundaries  of 
reasonable  lengths.  The  residual  errors  in  the  estimation  of 
the  dense  area  should  not  be  substantial  in  comparison  with 
the  inter-  and  intra-radiologists’  variations  in  their  manual 
segmentation. 

Successful  segmentation  of  dense  tissue  depends  strongly 
on  whether  a  mammogram  can  be  classified  correctly  into  a 
proper  class.  A  successful  classification  will  likely  result  in 
the  selection  of  a  near  optimal  threshold.  Conversely,  if  a 
mammogram  is  classified  into  a  wrong  class,  the  threshold 
will  be  selected  incorrectly.  For  the  mammograms  of  very 
fatty  breasts,  the  gray  level  histogram  has  the  characteristics 
of  Class  I,  which  contains  one  large  single  peak.  These  his¬ 
tograms  can  be  distinguished  relatively  easily  from  most  of 
the  other  classes  of  histograms  if  those  histograms  exhibit 
the  typical  features.  For  mammograms  of  BI-RADS  category 
2  or  3,  there  are  scattered  fibroglandular  or  heterogeneous 
densities  in  the  breast.  A  small  peak  may  be  located  on  the 
left  or  on  the  right,  or  on  both  sides  of  the  main  peak  on  the 
histogram.  The  histogram  could  be  classified  into  Class  I  if 
the  small  peak  is  not  large  enough  and  is  not  detected  as  a 
second  peak.  Otherwise,  it  would  be  classified  into  Class  II 
or  Class  III,  depending  on  the  location  of  that  small  peak 
relative  to  the  main  peak  of  the  histogram.  For  the  two-peak 
pattern  histogram,  the  DA  threshold  selection  method  is  ro¬ 
bust  if  there  is  an  obvious  valley  between  the  two  peaks.  If 
the  valley  is  flat  or  not  obvious,  averaging  the  two  thresholds 
obtained  by  the  DA  method  in  two  different  intervals,  as 
designed  for  this  study,  can  reduce  the  chance  of  calculating 
an  incorrect  threshold  that  differs  greatly  from  the  optimum, 
but  it  also  reduces  the  chance  of  finding  the  optimal  thresh¬ 
old.  Overall,  the  rules  designed  for  classification  of  the  two- 
peak  patterns  seem  to  perform  consistently  well  for  this  data 
set.  One  of  the  difficult  situations  is  to  distinguish  between 
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Table  II.  A  comparison  of  computer  segmentation  with  the  true  standard.  The  “difference”  was  defined  as  the 
difference  between  the  estimated  %  breast  density  and  the  true  standard  for  each  case,  and  the  mean  and  the 
standard  deviation  of  this  difference  are  tabulated. 


Image  subsets 

No.  of 
images 

Correlation 

RMS 

error 

Mean 

difference 

Std.  dev.  of 

difference 

CC  view: 

All 

130 

0.746 

12.3% 

1.3% 

12.3% 

Boundary  correctly  tracked 

120 

0.780 

11.4% 

1.4% 

11.4% 

Histogram  con'ectly  classified 

122 

0.943 

6.1% 

0.2% 

6.2% 

Boundary  and  histogram  correctly  done 
MLO  view: 

113 

0.953 

5.6% 

0.8% 

5.6% 

All 

130 

0.780 

11.6% 

1.9% 

11.5% 

Boundary  correctly  tracked 

120 

0.766 

11.9% 

2.1% 

11.7% 

Histogram  correctly  classified 

122 

0.914 

7.2% 

1.5% 

7.1% 

Pectoral  muscle  correctly  trimmed 

97 

0.733 

11.6% 

1.6% 

11.6% 

Boundary  and  histogram  correctly  done 

112 

0.912 

7.2% 

1.7% 

7.1% 

Boundary,  histogram  and  pectoral 
muscle  correctly  done 

83 

0.891 

7.1% 

1.9% 

6.8% 

Class  I  and  Class  IV,  when  the  histogram  of  a  very  dense 
breast  mimics  that  of  a  very  fatty  breast,  as  shown  in  Fig.  9. 
This  image  was  correctly  classified  with  the  additional  fea¬ 
tures,  Std  and  NSH.  However,  there  were  other  cases  that 
failed  in  spite  of  the  additional  criteria.  The  large  difference 
in  the  optimal  threshold  locations  between  these  two  classes 
will  lead  to  a  large  error  in  the  estimated  percent  breast  den¬ 
sity  if  the  histogram  is  misclassified.  Further  study  is  needed 
to  more  accurately  distinguish  these  two  classes. 

The  dynamic  range  reduction  technique  reduces  the  vari¬ 
ability  of  the  gray  level  histograms  and  enhances  their  char¬ 
acteristics.  This  pre-processing  facilitates  the  classification  of 
the  image  into  the  correct  class.  There  are  many  image 
smoothing  techniques  published  in  the  literature.  Low-pass 
filtering  with  a  box  filter  is  the  simplest  choice.  The  effec- 


MLO  view 


Fig.  8.  A  comparison  of  the  percent  breast  density  obtained  from  the  five 
radiologists’  manual  segmentation  with  their  average  for  the  same  mammo¬ 
grams.  The  MLO  view  is  shown.  The  trend  for  the  CC  view  is  similar. 


tiveness  of  background  correction  with  a  box  filtered  image 
depends  on  the  box  size.  We  found  that  a  35  X  35-pixel  filter 
is  a  good  balance  between  computation  time  and  the  capa¬ 
bility  to  remove  the  high  frequency  components.  The  sub¬ 
traction  of  the  low-pass  filtered  image  from  the  original  im¬ 
age  is  a  form  of  unsharp  masking.  The  breast  boundary  is 
generally  enhanced  as  shown  in  Fig.  2(e).  The  pixels  at  the 
enhanced  breast  boundary  contribute  a  small  peak  to  the  left 
tail  of  the  gray  level  histogram  of  the  breast  area.  Moreover, 
if  dense  tissue  is  present  close  to  the  breast  boundary,  it  may 
not  be  segmented  correctly  due  to  intensity  reduction.  Other 
low  frequency  estimation  techniques  such  as  wavelet  decom¬ 
position  will  be  investigated  in  future  studies. 

In  this  feasibility  study,  we  used  a  small  data  set  of  mam¬ 
mograms  to  develop  a  rule-based  classifier  for  the  histogram 
analysis.  Although  a  large  fraction  of  the  histograms  mani¬ 
fest  characteristic  features  that  can  be  grouped  into  four 
classes,  corresponding  approximately  to  the  four  BI-RADS 
breast  density  ratings,  there  are  many  exceptions.  One  such 
example  is  shown  in  Fig.  9.  This  causes  misclassification  and 
incorrect  thresholding  by  the  histogram  classifier.  It  will  be 


Table  III.  A  comparison  of  computer  classification  and  radiologists’ 
BI-RADS  breast  density  ratings. 


Computer 

classification 

BI-RADS 

1 

BI-RADS 

2 

BI-RADS 

3 

BI-RADS 

4 

Total 

Class  I 

210 

262 

52 

16 

540 

(16.2%) 

(20.2%) 

(4%) 

(1.2%) 

(41.5%) 

Class  II 

0 

92 

184 

24 

300 

(0%) 

(7.1%) 

(14.2%) 

(1.8%) 

(23.1%) 

Class  III 

1 

52 

167 

100 

320 

(0.1%) 

(4%) 

(12.8%) 

(7.7%) 

(24.6%) 

Class  IV 

5 

12 

43 

80 

140 

(0.4%) 

(0.9%) 

(3.3%) 

(6.2%) 

(10.8%) 

Total 

216 

418 

446 

220 

1300 

(16.6%) 

(32.2%) 

(34.3%) 

(16.9%) 

(100%) 
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Fig.  9.  The  gray  level  histograms  of  two  mammograms  classified  by  radiologists  as  Bl-RADS  rating  1  (upper  mammogram)  and  Bl-RADS  rating  4  (lower 
mammogram).  The  shapes  of  the  histograms  are  very  similar  and  cannot  be  distinguished  by  our  current  histogram  analysis  method.  These  two  examples  were 
correctly  classified  with  the  additional  Std  and  NSH  criteria. 


necessary  to  investigate  if  other  classification  strategies  can 
be  more  effective  than  a  rule-based  method.  Furthermore,  we 
have  not  performed  a  systematic  study  to  optimize  the  many 
parameters  used  in  the  segmentation  algorithm.  Further  work 
will  be  required  to  investigate  the  dependence  of  the  segmen¬ 
tation  accuracy  on  the  various  parameters.  The  parameter 
selection  and  the  performance  of  the  computer  classifier  will 
have  to  be  improved  by  training  with  a  larger  data  set  and  its 
generalizability  evaluated  with  unknown  cases.  The  generali¬ 
zation  of  the  algorithm  to  images  acquired  with  other  digi¬ 
tizers  or  direct  digital  mammography  systems  will  also  need 
to  be  investigated. 

V.  CONCLUSION 

We  are  developing  an  image  analysis  method  for  auto¬ 
mated  segmentation  of  the  dense  area  from  mammograms 
and  estimation  of  the  percent  mammographic  density.  Our 
preliminary  study  indicates  the  feasibility  of  our  approach. 
The  computer-estimated  mammographic  breast  density  cor¬ 
relate  closely  with  the  average  manual  segmentation  by  five 
experienced  radiologists  and  the  average  bias  is  much  less 
than  that  of  the  radiologists’  visual  estimation.  We  have 
found  that  correct  classification  of  the  histogram  shapes  is 
the  most  crucial  step  in  our  approach.  The  histograms  of 
many  mammograms  have  distinctive  characteristics  that  can 
be  recognized  by  a  rule-based  classifier.  However,  some  his¬ 
tograms  deviate  from  these  rules  and  this  can  lead  to  mis- 
classification.  A  further  investigation  will  be  needed  to  de¬ 
sign  more  robust  rules  or  classifiers  to  improve  the 
classification  accuracy.  Despite  these  limitations,  we  have 


demonstrated  in  this  preliminary  study  that  the  estimation  of 
mammographic  density  can  be  performed  efficiently  and  ac¬ 
curately  by  the  automated  image  analysis  tool.  The  fully  au¬ 
tomated  algorithm  can  provide  an  objective  and  reproducible 
quantitative  estimation  of  mammographic  breast  density  that 
is  expected  to  be  superior  to  subjective  visual  assessment  and 
comparable  to  manual  segmentation  by  radiologists. 

ACKNOWLEDGMENTS 

This  work  is  supported  by  USPHS  Grant  No.  CA48129, 
U.S.  Army  Medical  Research  and  Material  Command  grants 
(DAMD  17-99-1-9294)  and  (DAMD  17-01-1-0326)  and  by  a 
Career  Development  Award  (B.S)  from  the  USAMRMC 
(DAMD  17-96-1-6012).  The  content  of  this  paper  does  not 
necessarily  reflect  the  position  of  the  government  and  no 
official  endorsement  of  any  equipment  and  product  of  any 
companies  mentioned  in  this  paper  should  be  inferred. 

APPENDIX  A:  GRAY-LEVEL 

THRESHOLDING— DISCRIMINANT  ANALYSIS  (DA) 

METHOD 

Suppose  the  probability  of  the  gray  level  «,■  in  an  image 
with  L  gray  levels  can  be  estimated  as 

L 

Pi  =  niiN,  A=2  'b-  (Al) 

i=  1 

If  the  pixels  in  the  image  are  classified  into  two  classes  Cq 
and  Cl  by  the  threshold  k,  then  the  probabilities  of  class 
occurrence  and  the  class  mean  levels  are  given  by 
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Mq 


=  ^  Pi  =  P{k),  wi=  2  Pi='^-P{k),  (A2) 


;=1 

k 


=  it+l 


/^o=2  iPil(^o  =  P-{k)lo)Q, 

i=\ 


/iT-Mk) 


(A3) 


where 


k  k  L 

P(k)  =  ^  Pi,  /U.(^)  =  2  ipi  and  pLj='^  ipj,  (A4) 

1  =  1  1  =  1  (  =  1 


are  the  zeroth-  and  the  first-order  cumulative  moments  of  the 
histogram  up  to  the  ^th  level,  and  the  total  mean  level  of 
original  image,  respectively. 

The  between-class  variance  is  defined  as 


o-l{k)  =  wo(/Ao-/ir)^+  «i(/^i  -  P^t)^ 

,  ,2  {pTP{k)- p,{k)f 

=  ■  /.(«[! -W]  • 

and  the  optimal  threshold  A:*  is  given  by 
(^\{k*)=  max  (j\{k). 


(A5) 


(A6) 


APPENDIX  B:  GRAY-LEVEL 
THRESHOLDING— MAXIMUM  ENTROPY 
PRINCIPLE  (MEP)  METHOD 

Suppose  the  probability  of  the  gray  level  «,  in  an  image 
with  L  gray  levels  can  be  estimated  as 

L 

Pi  =  nilN,  A=2  «/■  (Bl) 

(  =  1 


After  thresholding  the  image  by  threshold  k,  the  a  posteriori 
probability  of  the  pixels  with  gray  level  value  less  than  k,  is 
given  by 

k 

F(k)  =  j,pi.  (B2) 

i'  =  0 

And  the  a  posteriori  probability  of  all  those  pixels  with  val¬ 
ues  greater  than  or  equal  to  k  is  \-F{k).  Thus  the  Shannon 
entropy  of  the  thresholded  image  is 

H{F{k))=-F{k)\ogF{k)-{\ -F{k))\og{\ -F{k)). 

(B3) 

The  optimal  threshold  k  maximizes  H{F{k)). 
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Previous  studies  have  found  that  mammographic  breast  density  is  highly  correlated  with  breast 
cancer  risk.  Therefore,  mammographic  breast  density  may  be  considered  as  an  important  risk  factor 
in  studies  of  breast  cancer  treatments.  In  this  paper,  we  evaluated  the  accuracy  of  using  mammo¬ 
grams  for  estimating  breast  density  by  analyzing  the  correlation  between  the  percent  mammo¬ 
graphic  dense  area  and  the  percent  glandular  tissue  volume  as  estimated  from  MR  images.  A  dataset 
of  67  cases  having  MR  images  (coronal  3-D  SPGR  T1 -weighted  pre-contrast)  and  corresponding 
4-view  mammograms  was  used  in  this  study.  Mammographic  breast  density  was  estimated  by  an 
experienced  radiologist  and  an  automated  image  analysis  tool.  Mammography  Density  ESTimator 
(MDEST)  developed  previously  in  our  laboratory.  Eor  the  estimation  of  the  percent  volume  of 
fibroglandular  tissue  in  breast  MR  images,  a  semiautomatic  method  was  developed  to  segment  the 
fibroglandular  tissue  from  each  slice.  The  tissue  volume  was  calculated  by  integration  over  all  slices 
containing  the  breast.  Interobserver  variation  was  measured  for  3  different  readers.  It  was  found  that 
the  correlation  between  every  two  of  the  three  readers  for  segmentation  of  MR  volumetric  fibro¬ 
glandular  tissue  was  0.99.  The  correlations  between  the  percent  volumetric  fibroglandular  tissue  on 
MR  images  and  the  percent  dense  area  of  the  CC  and  MLO  views  segmented  by  an  experienced 
radiologist  were  both  0.91.  The  correlation  between  the  percent  volumetric  fibroglandular  tissue  on 
MR  images  and  the  percent  dense  area  of  the  CC  and  MLO  views  segmented  by  MDEST  was  0.91 
and  0.89,  respectively.  The  root-mean-square  (rms)  residual  ranged  from  5.4%  to  6.3%.  The  mean 
bias  ranged  from  3%  to  6%.  The  high  correlation  indicates  that  changes  in  mammographic  density 
may  be  a  useful  indicator  of  changes  in  fibroglandular  tissue  volume  in  the  breast.  ©  2004  Ameri¬ 
can  Association  of  Physicists  in  Medicine.  [DOT  10.1118/1.1668512] 

Key  words:  mammography,  breast  density,  MR  images,  correlation 


I.  INTRODUCTION 

Studies  have  shown  that  there  is  a  strong  positive  correlation 
between  breast  parenchymal  density  imaged  on  mammo¬ 
grams  and  breast  cancer  risk.*^^  The  relative  risk  is  estimated 
to  be  about  4  to  6  for  women  whose  mammograms  have 
parenchymal  densities  over  60%  of  the  breast  area,  as  com¬ 
pared  to  women  with  less  than  5%  densities.  Other  cohort 
studies^^*^  also  found  that  breast  cancer  risk  in  the  category 
with  the  most  extensive  dense  tissue  was  1.8  to  6  times  as 
high  as  that  in  the  category  with  the  least  extensive  dense 
tissue.  Mammographic  density  as  the  risk  indicator  is  greater 
than  almost  all  other  risk  factors  of  breast  cancer.^’*"^  Al¬ 
though  there  is  no  direct  evidence  that  changes  in  mammo¬ 
graphic  breast  densities  will  result  in  changes  in  breast  can¬ 
cer  risk,  the  strong  correlation  between  breast  density  and 
breast  cancer  risk  has  prompted  researchers  to  use  mammo¬ 
graphic  density  for  monitoring  the  effects  of  intervention  as 
well  as  for  studying  breast  cancer  etiology. 

A  number  of  researchers  have  investigated  image 
analysis  techniques  to  estimate  breast  density.*^’*^^^*  The 
common  approaches  are  to  analyze  the  textural  pattern  or  the 
percentage  of  mammographic  densities  relative  to  the  breast 
area.  It  has  been  found  that  the  texture  measures  were  corre¬ 


lated  with  parenchymal  density  patterns  but  they  appeared  to 
be  less  sensitive  measures  of  relative  risk  than  the  percent 
dense  area.*’^^’^®  In  current  practice,  breast  density  is  esti¬ 
mated  mainly  by  radiologists’  visual  judgment  of  the  fibro¬ 
glandular  tissue  imaged  on  mammograms  following  the 
Breast  Imaging — Reporting  and  Data  System  (BI-RADS) 
lexicon. Because  of  the  qualitative  and  subjective  nature 
of  visual  judgment,  there  are  large  intraobserver  and  interob¬ 
server  variations  in  the  estimated  breast  density.  The  large 
variability  may  reduce  the  observed  correlation  between 
breast  cancer  risk  and  breast  density.  It  may  also  reduce  the 
sensitivity  of  studies  using  mammographic  density  for  moni¬ 
toring  the  effect  of  risk  modifying  treatments.  We  have  de¬ 
veloped  an  automated  image  analysis  system,  Mammo¬ 
graphic  Density  ESTimator  (MDEST),  to  assist  radiologists 
in  estimating  breast  density  on  mammograms.  A  computer¬ 
ized  analysis  is  expected  to  increase  the  reproducibility  and 
consistency  in  the  estimation  of  mammographic  density, 
thereby  improving  the  accuracy  of  the  related  studies.  In  our 
previous  study,  we  have  found  that  the  percent  mammo¬ 
graphic  density  segmented  by  MDEST  agreed  closely  with 
that  estimated  by  radiologists’  interactive  thresholding.^^ 

The  high  correlation  between  breast  cancer  risk  and  breast 
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density  indicates  that  breast  cancer  risk  may  be  closely  re¬ 
lated  to  the  volume  of  glandular  tissue  in  the  breast.  Among 
the  modalities  available  for  breast  imaging  at  present,  mag¬ 
netic  resonance  (MR)  imaging  is  likely  to  be  the  most  accu¬ 
rate  method  for  volumetric  dense  tissue  estimation  because 
fibroglandular  tissue  and  adipose  tissue  can  be  well  distin¬ 
guished  in  MR  images  when  a  proper  image  acquisition  tech¬ 
nique  is  used.^^  However,  MR  imaging  is  expensive,  making 
it  difficult  to  use  MR  imaging  as  a  routine  monitoring 
tool.^^’^'*  On  the  other  hand,  a  mammogram  is  a  two- 
dimensional  (2-D)  projection  image  of  a  three-dimensional 
(3-D)  object.  The  area  of  dense  tissue  measured  on  a  mam¬ 
mogram  is  not  an  accurate  measure  of  the  volume  of  fibro¬ 
glandular  tissue  in  the  breast  because  no  thickness  informa¬ 
tion  is  used.  However,  mammography  is  a  widely  available 
low  cost  procedure  that  may  be  used  for  monitoring  breast 
density  change  during  preventive  and  interventional  treat¬ 
ment  or  other  studies.  Women  who  participate  in  screening 
will  also  have  mammograms  readily  available  for  retrospec¬ 
tive  review.  Therefore,  mammography  will  most  likely  be  the 
method  of  choice  for  breast  density  estimation. 

In  this  study,  we  investigated  the  correlation  between  the 
volumetric  fibroglandular  tissue  in  the  breast  and  the  pro¬ 
jected  breast  dense  area  on  mammograms  by  analyzing  the 
percent  volumetric  fibroglandular  tissue  in  MR  breast  images 
and  the  percent  dense  area  in  corresponding  mammograms. 
Our  purpose  in  this  study  is  not  to  evaluate  the  usefulness  of 
either  MR  fibroglandular  tissue  volume  or  mammographic 
density  as  an  indicator  for  breast  cancer  risk,  which  have 
been  studied  by  other  investigators.  Rather,  we  used  the  MR 
breast  images  to  estimate  the  volumetric  fibroglandular  tissue 
in  the  breast  and  explored  the  reason  that  a  change  in  mam¬ 
mographic  density  (2-D)  can  be  used  as  an  indicator  of  breast 
density  change  (3-D).  These  comparisons  will  provide  a  bet¬ 
ter  understanding  of  their  relationship,  and  may  lead  to  im¬ 
proved  methods  for  utilizing  mammographic  density  as  a 
surrogate  marker  for  breast  cancer  risk. 

II.  MATERIALS  AND  METHOD 

A.  Dataset 

In  a  previous  study,  gadolinium  contrast  enhanced  MR 
dynamic  imaging  was  employed  to  characterize  malignant 
and  benign  breast  lesions.  A  dataset  was  collected  with  IRB 
approval  which  included  MR  images  and  corresponding 
mammograms  acquired  between  detection  and  before  biopsy 
for  a  given  patient.  In  the  MR  study,  several  series  of  images 
were  acquired  for  each  patient.  Patients  were  scanned  prone 
using  a  commercial  dual  phased-array  breast  coil.  The  imag¬ 
ing  protocol  included  a  series  was  the  coronal  3-D  Tl- 
weighted  pre-contrast  series  (coronal  sections  2-5  mm  thick, 
32  slices;  3-D  Spoiled  Gradient-Recalled  Echo  (SPGR);  TE 
=  3.3  ms;  TR=10ms,  Elip=40°,  matrix=256X  128,  EOV 
=  28-32  cm  right/left,  14-16  cm  superior/inferior,  scan 
time  =  2  min  38  sec).  This  3-D  SPGR  sequence  produces  full 
volume  coverage  of  both  breasts  with  contiguous  image  sec¬ 
tions.  The  dense  parenchyma  and  fat  tissue  are  well  sepa¬ 
rated  with  this  heavily  T1 -weighted  acquisition.  We  used  a 


set  of  67  patients  to  study  the  correlation  between  the  2-D 
projected  percentage  of  dense  area  on  a  mammogram  and  the 
percentage  of  dense  tissue  volume  estimated  from  the  3-D 
MR  images. 

The  mammograms  consisting  of  the  craniocaudal  (CC) 
view  and  the  mediolateral  oblique  (MLO)  view  of  both 
breasts  of  the  patient  were  digitized  with  a  LUMISYS  85 
laser  film  scanner  at  a  pixel  size  of  50  yU,mX50  /rm.  The 
digitizer  has  a  gray  level  resolution  of  12  bits  and  a  nominal 
optical  density  (O.D.)  range  of  0  to  4.  For  density  segmen¬ 
tation,  it  is  not  necessary  to  use  very  high-resolution  images. 
To  reduce  processing  time,  the  full  resolution  mammograms 
were  first  smoothed  with  a  16X16  box  filter  and  subsampled 
by  a  factor  of  16,  resulting  in  800  yttmX  800  /rm  images  for 
this  study. 

B.  Estimation  of  fibrogianduiar  tissue  voiume  on  MR 
images 

Since  it  is  not  our  intention  to  routinely  segment  MR  im¬ 
ages  for  breast  density  estimation,  we  did  not  attempt  to 
develop  an  automated  method  for  this  application.  Our  algo¬ 
rithm  for  segmentation  of  volumetric  fibroglandular  tissue  on 
MR  images  used  a  semi-automatic  method.  The  computer 
performed  an  initial  segmentation.  A  graphical  user  interface 
(GUI)  was  developed  to  allow  a  user  to  review  the  segmen¬ 
tation  of  every  slice  and  make  modifications  if  necessary. 
The  method  consists  of  four  steps.  First,  the  breast  boundary 
was  detected  automatically  on  each  slice.  A  deformable 
model  and  manual  modification  were  used  to  correct  for  in¬ 
correctly  detected  boundaries  that  usually  occurred  in  slices 
near  the  chest  wall  where  there  were  no  well-defined  breast 
boundaries.  Because  of  inhomogeneity  of  the  breast  coil  sen¬ 
sitivity,  the  signal  intensity  in  the  breast  region  was  not  uni¬ 
form  across  the  field  of  view.  A  background  correction  tech¬ 
nique  that  estimated  the  low  frequency  background  from  the 
gray  levels  along  the  breast  boundary  was  developed  to  re¬ 
duce  this  systematic  nonuniformity.  Manual  interactive 
thresholding  of  the  gray  level  histogram  in  the  breast  region 
was  then  used  to  separate  the  fibroglandular  from  the  fatty 
region.  Morphological  erosion  was  used  to  exclude  the  skin 
voxels  along  the  breast  boundary.  Finally,  the  volume  of  fi¬ 
broglandular  tissue  was  calculated  by  integration  over  all 
slices  containing  the  breast.  A  flow  chart  of  our  algorithm  is 
shown  in  Fig.  1. 

C.  Breast  boundary  detection 

A  two-step  algorithm  was  developed  for  the  detection  of 
breast  boundary  on  each  slice.  First,  we  used  a  seeded  pixel 
thresholding  algorithm  (SPTA)  for  the  initial  assessment  of  a 
breast  boundary.  Second,  a  2-D  active  contour  algorithm  fur¬ 
ther  refined  the  boundary.  For  slices  close  to  the  chest  wall 
where  no  clear  boundary  can  be  seen,  manual  modification 
was  used  to  outline  an  estimated  boundary. 

The  SPTA  determined  the  optimal  threshold  by  iteratively 
partitioning  the  MR  image  into  two  parts  and  using  the  gra¬ 
dient  value  along  the  boundary  of  the  partition  as  a  guide  in 
optimizing  the  threshold.  First,  the  center  of  gravity  was  se- 
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Fig.  1 .  The  flow-chart  for  the  segmentation  of  the  fibroglandular  tissue  on 
MR  images. 


(a)  (b)  (c) 

Fig.  2.  An  example  of  the  first  three  processing  blocks  in  Fig.  1.  (a)  Original 
MR  slice;  (b)  automatically-detected  breast  boundary  superimposed  on  the 
image;  and  (c)  the  background-corrected  image. 


^exert  l^grad^grad(f^ )  l^bal^balC  )  5  (^) 

where  curv,  cont,  grad,  bal,  hom  denoted  curvature,  continu¬ 
ity,  gradient,  balloon  force  and  homogeneity,  respectively, 
and  each  energy  term  was  associated  with  a  weight,  w.  The 
detailed  definition  for  each  term  can  be  found  in  the 
literature. An  example  of  a  MR  slice  of  a  breast  is  shown  in 
Fig.  2(a),  and  the  segmented  boundary  is  shown  in  Fig.  2(b). 
Note  that  the  two  breasts  of  a  patient  were  scanned  together 
but  each  breast  was  analyzed  separately. 


lected  as  the  starting  pixel  on  each  slice.  The  gray  level  of 
the  starting  pixel  was  used  as  a  threshold  to  create  a  binary 
partition  of  the  image  in  which  all  pixels  greater  than  the 
threshold  were  set  to  one  and  all  other  pixels  were  set  to 
zero.  Second,  the  gradient  value  of  each  pixel  on  the  bound¬ 
ary  of  the  binary  partition  was  calculated  by  applying  the 
Sobel  filter  to  the  original  image.  The  gradient  assessment 
for  this  particular  binary  partition  was  defined  as  the  average 
gradient  magnitude  of  these  boundary  pixels.  The  threshold 
value  was  reduced  to  zero  in  a  stepwise  manner.  The  parti¬ 
tion  for  each  threshold  value  was  created  and  the  gradient 
assessment  for  each  partition  was  calculated  as  described 
above.  The  partition  with  the  maximum  gradient  assessment 
was  considered  to  be  the  initial  segmentation  result  for  the 
breast,  and  the  boundary  of  this  partition  was  considered  to 
be  the  initial  breast  boundary. 

After  the  initial  segmentation,  a  deformable  contour 
method  was  used  to  further  refine  the  boundary.  The  move¬ 
ment  of  the  boundary  pixel  was  controlled  by  an  energy 
function  which  consisted  of  internal  energy  and  external  en¬ 
ergy.  The  internal  energy  components  used  in  this  study  were 
the  continuity  and  curvature  of  the  contour,  as  well  as  the 
homogeneity  of  the  segmented  partition.  The  external  energy 
components  were  the  negative  of  the  smoothed  image  gradi¬ 
ent  magnitude,  and  a  balloon  force  that  exerted  pressure  at  a 
normal  direction  to  the  contour.  The  energy  function  was 
defined  as  the  following; 

N 

E='Z  [£inter(c)+£exert(c)]>  (1) 

c=l 

where  Winter  ^exert  ^^e  internal  energy  and  the  external 
energy,  respectively,  as  defined  in  Eq.  (2)  and  Eq.  (3); 

^  inter  ^curv^curv  (c)  T  ~f  VrjiQjn^'hom  1  (^) 


D.  Background  correction 

To  reduce  the  nonuniformity  of  the  MR  signal  intensity  in 
the  breast  region,  a  background  correction  technique^®  using 
the  pixel  values  around  the  segmented  breast  region  was  em¬ 
ployed.  For  a  given  pixel  (ij)  inside  the  breast  region,  the 
gray  value  of  the  background  image  was  estimated  as  shown 
in  Eq.  (4): 
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where  L,  R,  U  and  D  are  the  average  gray  values  inside  a 
breast  background  estimation  region  (BBER)  centered  at  the 
left,  right,  upper  and  lower  pixels  on  the  breast  boundary, 
respectively.  A  BBER  was  defined  as  the  intersection  of  a 
21 X  21-pixel  box  and  the  breast  region.  The  center  pixels  for 
the  left  and  right  boxes  were  the  intersection  points  between 
the  breast  boundary  and  a  horizontal  line  passing  through  the 
given  pixel  (/,J).  Similarly,  the  upper  and  lower  center  pix¬ 
els  for  the  upper  and  lower  boxes  were  the  intersection  points 
between  the  breast  boundary  and  a  vertical  line  passing 
through  the  given  pixel  (/,j).  Only  the  pixels  that  were 
within  the  intersected  area  between  the  21 X  21 -pixel  box  and 
the  breast  region  were  included  in  the  definition  of  the  BBER 
and  the  calculation  of  the  average  gray  value.  The  contribu¬ 
tions  of  the  average  gray  levels  to  the  background  pixel  {i,]) 
were  inversely  weighted  by  their  distances  dj  ,dy  ,dii  ,d^ 
from  the  given  pixel  (iJ).  An  example  of  the  background 
corrected  image  is  shown  in  Eig.  2(c). 

E.  Segmentation  of  fibroglandular  tissue 

We  developed  a  GUI  that  allowed  the  user  to  perform  a 
combination  of  manual  and  automatic  operations  to  segment 
the  breast  boundary  and  the  fibroglandular  tissue  on  the  MR 
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Fig.  3.  The  graphic  user  interface  for 
the  segmentation  of  the  fibroglandular 
tissues  on  the  MR  slice.  The  upper 
row  shows  the  original  MR  slice  (left), 
the  background-corrected  image 
(middle)  and  the  segmented  binary  im¬ 
age  (right).  The  segmented  image  re¬ 
sponds  to  the  reader’s  adjustment  of 
the  gray  level  threshold  (lower  row)  in 
real  time  so  that  the  reader  can  choose 
the  appropriate  threshold  by  inspecting 
the  segmented  image  visually.  The 
dark  area  in  the  segmented  image  in¬ 
dicates  the  fibroglandular  tissue  and 
the  white  area  indicates  the  adipose 
tissue.  The  inner  line  along  the  breast 
boundary  is  the  boundary  obtained  by 
morphological  erosion  to  exclude  the 
skin  voxels  for  calculating  the  fibro¬ 
glandular  tissue  volume. 


images.  The  first  window  (not  shown)  displayed  the  MR  se¬ 
ries  and  the  corresponding  mammogram  of  each  breast  to 
give  the  user  an  overview  of  the  breast.  The  segmentation  of 
the  fibroglandular  tissue  on  each  MR  slice  was  processed  in 
the  second  window,  shown  in  Fig.  3.  The  original  MR  slice, 
the  corresponding  background  corrected  image  and  the  seg¬ 
mented  binary  image  were  shown  in  the  upper  part  of  the 
window.  At  the  lower  part  of  the  window,  the  histogram  of 
the  voxel  values  in  the  breast  region  was  shown.  The  user 
performed  interactive  thresholding  on  the  histogram  and  the 
segmented  binary  image  corresponding  to  the  chosen  thresh¬ 
old  was  displayed  in  real  time  in  the  upper  part.  If  the  breast 
boundary,  which  was  automatically  segmented  by  the  com¬ 
puter  initially,  had  to  be  corrected,  the  user  could  go  to  the 
third  window  and  manually  move  the  apices  of  the  polygon 
outlining  the  boundary.  The  voxels  contributed  by  the  nipple 
were  excluded.  On  the  slices  containing  breast  skin  that  had 
voxel  values  similar  to  those  of  fibroglandular  tissue,  a  mor¬ 
phological  erosion  operation  was  applied  to  the  breast 
boundary  to  exclude  the  skin  voxels  from  the  calculation  of 
the  fibroglandular  tissue  volume  in  the  slice.  The  size  of  the 
structuring  element  could  be  selected  interactively  on  the 
fourth  window  and  the  eroded  boundary  was  displayed  in¬ 
stantly  for  a  chosen  erosion  operation.  The  user  might  again 
change  the  structuring  element  if  the  erosion  result  of  the 
previous  choice  was  deemed  unsatisfactory.  Since  the  eroded 
boundary  only  marked  the  region  within  which  the  fibroglan¬ 
dular  voxels  would  be  summed  and  would  not  be  used  for 
the  calculation  of  the  breast  volume,  as  described  below,  it 
did  not  need  to  be  precise  as  long  as  it  excluded  the  skin 
voxels  while  not  excluding  the  fibroglandular  voxels. 


F.  MR  fibroglandular  tissue  volume 

After  the  fibroglandular  tissue  was  segmented  for  each 
slice,  the  total  number  of  voxels  containing  the  fibroglandu¬ 
lar  tissue  was  obtained  as  a  summation  of  these  voxels  over 
all  slices  of  the  breast.  The  total  volume  of  the  breast  was 
obtained  as  the  summation  of  the  voxels  enclosed  by  the 
breast  boundary  before  morphological  erosion.  The  ratio  of 
these  two  volumes  provided  the  percent  volumetric  fibro¬ 
glandular  tissue  in  the  breast. 

G.  Mammographic  density  segmentation 

We  have  previously  developed  an  automated  method  for 
segmentation  of  the  dense  fibroglandular  area  on  mammo¬ 
grams.  The  method,  referred  to  as  the  Mammographic  Den¬ 
sity  ESTimator  (MDEST)  was  described  in  detail 
elsewhere. In  brief,  the  breast  boundary  on  the  digitized 
mammogram  is  tracked.  A  dynamic-range  compression  tech¬ 
nique  reduces  the  gray  level  range  of  the  breast  area.  By 
analyzing  the  shape  of  the  gray  level  histogram,  a  rule-based 
classifier  classifies  the  breast  density  into  one  of  four  classes. 
Typically,  a  Class  I  breast  is  almost  entirely  fat;  it  has  a 
single  narrow  peak  on  the  histogram.  A  Class  II  breast  con¬ 
tains  scattered  fibroglandular  densities.  Its  histogram  has  two 
main  peaks,  with  the  smaller  peak  on  the  right  of  the  bigger 
one.  A  Class  III  breast  is  heterogeneously  dense.  Its  histo¬ 
gram  also  has  two  peaks,  but  the  smaller  peak  is  on  the  left 
of  the  bigger  one.  A  Class  IV  breast  is  extremely  dense.  Its 
histogram  has  mainly  a  single  dominant  peak,  but  the  peak  is 
wider  compared  with  the  peak  in  the  Class  I  histogram.  A 
second  smaller  peak  sometimes  occurs  on  the  left  of  the 
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Fig.  4.  A  comparison  of  the  percent  mammographic 
density  obtained  from  interactive  thresholding  by  an 
MQSA-qualified  radiologist  and  that  estimated  by  our 
automated  MDEST  computer  program,  (a)  CC  view, 
correlation  coefficient=0.90,  rms  residual=6.7,  mean 
difference=0.3;  (b)  MLO  view,  correlation  coefficient 
=  0.89,  rms  residual=6.1,  mean  difference =0.4. 
Dashed  line:  linear  regression  of  the  data;  solid  line: 
diagonal. 


main  peak.  Based  on  the  histogram  shape,  a  threshold  is 
automatically  calculated  to  separate  the  dense  and  fatty  pix¬ 
els.  The  mammographic  density  was  estimated  as  the  per¬ 
centage  of  fibroglandular  tissue  area  relative  to  the  total 
breast  area.  For  MLO  view  mammograms,  the  pectoral 
muscle  is  detected  and  excluded  from  the  density  area  or 
breast  area  calculations.  In  our  previous  work,  the  perfor¬ 
mance  of  MDEST  was  verified  by  comparison  with  manual 
segmentation  by  5  breast  imaging  radiologists  using  a  dataset 
of  260  mammograms  from  65  patients  that  were  different 
from  the  cases  used  in  the  current  study.  We  found  that  the 
correlation  between  the  computer-estimated  percent  dense 
area  and  the  average  segmentation  by  the  5  radiologists  was 
0.94  and  0.91,  respectively,  for  CC  and  MLO  views,  with  a 
mean  bias  of  less  than  2%. 

MDEST  was  applied  to  the  mammograms  of  the  67  pa¬ 
tients  used  in  this  study.  The  percent  dense  area  on  mammo¬ 
grams  was  estimated  for  the  CC-view  and  the  MLO-view 
mammogram  of  each  breast  separately.  In  addition,  an 
MQSA-qualified  radiologist  also  segmented  the  dense  area 
by  interactive  thresholding  for  each  mammogram.  The  cor¬ 
relation  between  the  mammographic  density  obtained  by 
manual  and  automatic  segmentation  is  shown  in  Eigs.  4(a) 
and  4(b)  for  the  CC  view  and  MLO  view,  respectively.  The 
correlation  coefficients  for  the  CC  view  and  MLO  view  were 
0.90  and  0.89,  respectively.  The  mammographic  densities  es¬ 
timated  by  automatic  and  manual  segmentation  were  com¬ 
pared  with  the  percent  volumetric  fibroglandular  tissue  on 
MR  images  as  described  below. 

H.  Observer  experiments 

We  performed  an  experiment  to  evaluate  the  variability  of 
the  estimated  %  volumetric  fibroglandular  tissue  due  to  the 
uncertainty  in  the  determination  of  the  starting  slice  of  the 
breast  at  the  chest  wall.  The  starting  slice  affected  the  esti¬ 
mation  of  the  breast  volume  that  was  calculated  by  integrat¬ 
ing  from  the  starting  slice  to  the  anterior  of  the  breast. 
Twenty-three  MR  cases  from  the  dataset  were  randomly  se¬ 
lected  for  this  observer  experiment.  There  were  a  total  of  41 
breasts  because  some  cases  had  only  one  breast.  Eor  this 
subset  of  cases,  each  radiologist  was  asked  to  select  the  start¬ 
ing  slice  from  the  MR  images  for  each  breast.  The  estimated 


%  volumetric  fibroglandular  tissue  calculated  with  all  avail¬ 
able  slices  was  then  compared  to  that  calculated  with  the 
selected  starting  slice. 

We  also  performed  observer  experiments  to  evaluate  the 
inter-observer  variations  in  the  segmentation  of  fibroglandu¬ 
lar  tissue  using  the  semi-automatic  method.  Two  MQSA- 
qualified  radiologists  performed  the  segmentation  of  the  fib¬ 
roglandular  tissue  on  the  MR  images  of  the  41  breasts  using 
the  semi-automatic  method  implemented  with  the  GUI.  A 
Ph.D.  researcher  who  was  trained  by  these  radiologists  also 
performed  the  segmentation  independently  with  the  GUI. 

After  verifying  the  consistency  of  segmentation  by  these 
observers,  the  trained  Ph.D.  completed  the  segmentation  of 
all  MR  cases.  The  correlation  between  percent  volumetric 
fibroglandular  tissue  on  MR  images  and  percent  dense  area 
on  mammograms  was  then  examined  for  the  entire  dataset. 

III.  RESULTS 

A.  Effect  of  selection  of  the  starting  slice 

Eigure  5(a)  shows  the  correlation  of  the  %  volumetric 
fibroglandular  tissue  calculated  using  all  available  slices  for 
the  breast  with  that  calculated  using  the  selected  starting 
slice  by  radiologist  A  for  the  41  breasts.  The  correlation  co¬ 
efficient  was  0.999.  To  compare  the  difference  between  their 
results,  the  mean  difference  and  the  root-mean-square  (rms) 
residual,  which  is  the  residual  from  the  linear  least-squares- 
fitted  line,  were  also  calculated.  The  mean  difference  was  0.7 
and  the  rms  residual  was  0.6.  The  result  is  similar  for  radi¬ 
ologist  B  (not  shown),  with  a  correlation  coefficient  of  0.999, 
a  mean  difference  of  0.4  and  a  rms  residual  of  0.4.  The 
correlation  between  the  %  volumetric  fibroglandular  tissue 
calculated  using  the  selected  starting  slice  by  radiologist  A 
with  that  calculated  using  the  selected  starting  slice  by  radi¬ 
ologist  B  was  also  very  high  with  a  correlation  coefficient  of 
0.988,  a  mean  difference  of  0.7  and  a  rms  residual  of  1 .8,  as 
shown  in  Eig.  5(b).  These  comparisons  indicated  that  the 
variability  in  the  selection  of  the  starting  slice  of  the  breasts 
did  not  have  a  strong  influence  on  the  %  volumetric  fibro¬ 
glandular  tissue.  We  therefore  used  all  available  slices  in  the 
MR  dataset  for  each  breast  in  the  following  analyses. 
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%  Volume-Radiologist  A  (without) 

(a) 


%  Volume-Radiologist  B  (with) 

(b) 


Fig.  5.  (a)  A  comparison  of  the  percent  fibroglandular 
tissue  volume  calculated  using  the  selected  starting 
slice  with  that  calculated  using  all  available  slices  for 
radiologist  A,  coiTelation  coefficient=  0.999.  (b)  A  com¬ 
parison  of  the  percent  fibroglandular  tissue  volume  cal¬ 
culated  using  the  selected  starting  slice  by  radiologist  B 
with  that  by  radiologist  A,  correlation  coefficient 
=  0.988,  Dashed  line:  linear  regression  of  the  data; 
solid  line:  diagonal. 


B.  Inter-observer  variation  between  radioiogists 

Figure  6(a)  shows  the  comparison  of  the  percent  volumet¬ 
ric  fibroglandular  tissues  on  MR  images  segmented  by  two 
radiologists  for  the  41  breasts.  The  correlation  between  the 
segmentation  results  of  the  two  radiologists  is  0.99.  The 
mean  difference  was  found  to  be  0.3  and  the  rms  residual 
was  1.6. 

C.  Inter-observer  variation  between  radioiogists  and 
trained  Ph.D. 

Figure  6(b)  shows  the  comparison  of  the  percent  volumet¬ 
ric  fibroglandular  tissues  segmented  by  the  trained  Ph.D. 
against  that  segmented  by  radiologist  A.  A  similar  result  was 
obtained  by  comparing  the  percent  volumetric  tissue  seg¬ 
mented  by  the  trained  Ph.D.  and  that  segmented  by  radiolo¬ 
gist  A  except  that  the  data  points  were  even  closer  to  the 
diagonal  (not  shown).  The  correlation  between  the  result  of 
the  trained  Ph.D.  and  the  results  of  both  radiologists  was 
0.99.  The  corresponding  mean  differences  were  —0.8  and 
—  0.4,  respectively,  and  the  rms  residuals  were  1.4  and  1.5, 
respectively. 

D.  Correlation  between  percent  volumetric 
fibroglandular  tissue  on  MR  images  and  percent 
mammographic  density 

The  percent  volumetric  fibroglandular  tissue  on  MR  im¬ 
ages  was  compared  with  the  percent  dense  area  on  CC-  and 


MLO-view  mammograms.  After  verifying  that  the  difference 
in  segmentation  between  the  trained  Ph.D.  and  the  radiolo¬ 
gists  was  similar  to  the  interobserver  variations  between  the 
two  experienced  radiologists,  the  trained  Ph.D.  completed 
the  segmentation  of  the  entire  dataset. 

Figure  7  shows  the  comparison  of  the  percent  volumetric 
fibroglandular  tissue  on  MRI  and  the  percent  mammographic 
density  segmented  by  a  radiologist.  The  percent  areas  on  CC- 
and  MLO-view  mammograms  are  higher  than  the  percent 
volume  on  MR  images  with  a  mean  difference  of  5.7%  and 
3.0%,  respectively. 

Figure  8  shows  the  comparison  of  the  percent  volumetric 
fibroglandular  tissue  on  MRI  and  the  percent  mammographic 
density  segmented  by  MDEST.  The  percent  areas  on  CC- 
and  MLO-view  mammograms  segmented  by  the  computer 
are  higher  than  the  percent  volume  on  MR  images  with  a 
mean  difference  of  5.3%  and  2.6%,  respectively. 

The  correlation  coefficients,  the  mean  differences  and  the 
rms  residuals  between  the  percent  volumetric  fibroglandular 
tissue  on  MR  images  and  percent  dense  area  on  mammo¬ 
grams  are  compared  in  Table.  1.  The  correlation  between  the 
percent  volume  on  MR  images  and  percent  area  on  mammo¬ 
grams  of  the  fibroglandular  breast  tissue  is  high,  ranging 
from  0.89  to  0.91.  Although  it  is  not  expected  that  the  values 
of  percent  volume  agree  with  the  values  of  percent  area,  their 
mean  differences  range  only  from  3%  to  6%  and  the  rms 
residual  range  from  5.4  to  6.3. 


%  Volume-MRI  (Radiologist  B) 


%  Volume-MRI  (Radiologist  A) 


Fig.  6.  A  comparison  of  the  segmentation  of  fibroglan¬ 
dular  tissue  from  MR  images  between  two  observers: 
(a)  two  experienced  MQSA-qualified  radiologists,  cor¬ 
relation  coefficient=0.99.  (b)  The  trained  Ph.D.  and 
Radiologist  A,  correlation  coefficient=  0.99.  The  corre¬ 
lation  between  the  trained  Ph.D.  and  Radiologist  B  is 
also  0.99  but  the  data  points  were  very  close  to  the 
diagonal  and  is  not  shown.  The  %  volumetric  fibroglan¬ 
dular  tissue  was  calculated  using  all  available  slices. 
Dashed  line:  linear  regression  of  the  data;  solid  line: 
diagonal. 


(a) 


(b) 
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%  Volume-MRI  %  Volume-MRI 

(a)  (b) 


Fig.  7.  A  comparison  of  the  percent  fibroglandular  tis¬ 
sue  volume  on  MR  images  and  the  percent  dense  area 
on  mammograms  segmented  by  an  experienced  radiolo¬ 
gist.  (a)  CC  view,  correlation  coefficient=0.91;  (b) 
MLO  view,  correlation  coefficient=0.91.  Dashed  line: 
linear  regression  of  the  data;  solid  line:  diagonal. 


IV.  DISCUSSION 

Our  purpose  in  this  paper  was  to  investigate  the  relation¬ 
ship  between  the  percent  dense  area  on  mammogram  and  the 
percent  fibroglandular  tissue  volume  on  MR  image.  We 
found  a  direct  correlation  between  mammographic  density 
and  MR  volumetric  density  (Fig.  7  and  Fig.  8).  The  correla¬ 
tion  coefficients  between  the  percent  area  on  a  mammogram 
and  the  percent  volume  on  MR  images  are  high  at  0.89  and 
0.91.  These  results  are  more  promising  than  those  found  in 
previous  studies  that  attempted  to  correlate  percent  dense 
area  on  mammograms  with  MR  information.  Graham  et  al?^ 
investigated  the  relationship  between  percent  density  (pro¬ 
jected  dense  area)  on  mammogram  and  two  objective  MR 
parameters  of  breast  tissue,  relative  water  content  and  mean 
T2  relaxation.  Their  results  with  45  cases  showed  a  positive 
correlation  between  percent  density  and  relative  water  con¬ 
tent  (Pearson  correlation  coefficient=0.79)  and  a  negative 
correlation  between  percent  density  and  mean  T2  value 
(Pearson  correlation  coefficient=  —  0.61).  Another  study  by 
Lee  et  al.^^  analyzed  fatty  and  fibroglandular  tissue  in  differ¬ 
ent  age  groups  to  compare  x-ray  mammography  with  Tl- 
weighted  MR  images.  Their  study  with  40  cases  indicated 
that  the  correlation  between  the  two  techniques  is  0.63  when 
the  fat  content  was  more  than  45%.  However,  the  correlation 
coefficient  decreased  to  0.34  when  their  analysis  included 
only  dense  breasts. 

It  may  be  noted  that  although  MR  imaging  is  currently  the 
most  accurate  method  for  estimating  the  volumetric  fibro¬ 


glandular  tissue  in  the  breast,  it  is  still  not  the  ideal  tool. 
Fibrous  tissue  and  glandular  tissue  are  not  well  separated 
with  current  MR  imaging  techniques.  Since  the  amount  of 
glandular  tissue  in  the  breast  is  the  important  factor  relating 
to  breast  cancer  risk,  further  studies  are  warranted  for  differ¬ 
entiating  the  glandular  and  the  fibrous  components  of  the 
imaged  volume.  The  correlation  between  the  percent  glandu¬ 
lar  tissue  volume  and  percent  projected  dense  area  on  a 
mammogram  will  be  a  more  reliable  indicator  of  the  useful¬ 
ness  of  mammographic  density  analysis. 

The  density  on  mammograms  is  a  2-D  projected  area  of 
the  fibroglandular  tissues.  The  percent  dense  area  is  not  ex¬ 
pected  to  be  equal  in  value  to  the  percent  volume.  The  mean 
differences  between  the  percent  volume  and  the  percent  area 
on  CC-  and  MLO-views,  as  determined  by  the  radiologist’s 
interactive  segmentation,  are  5.7  and  3.0,  respectively  (Table 
I),  with  the  percent  dense  area  values  being  higher.  We  also 
investigated  the  rms  residual  between  the  percent  volume 
and  the  percent  area  when  the  relationship  between  them  was 
assumed  to  be  linear.  The  rms  residual  between  the  percent 
volume  and  the  percent  area  on  CC-  and  MLO-views  are  6.3 
and  5.6,  respectively  (Table  I),  relative  to  the  straight  line 
obtained  from  linear  least  squares  fits  to  the  data.  One  pos¬ 
sible  factor  that  may  contribute  to  a  higher  value  of  percent 
dense  area  on  mammograms  than  the  percent  volume  value 
on  MR  images  is  that  the  tissue  volume  imaged  by  the  two 
modalities  is  somewhat  different.  The  MR  images  include 
more  tissue  near  the  chest  wall,  which  is  mainly  retroglan- 


Fig.  8.  A  comparison  of  the  percent  volume  on  MR 
images  and  the  percent  area  on  mammogram  segmented 
by  our  automated  MDEST  computer  program,  (a)  CC 
view,  con'elation  coefficient=0.91;  (b)  MLO  view,  cor¬ 
relation  coefficient=0.89.  Dashed  line:  linear  regres¬ 
sion  of  the  data;  solid  line:  diagonal. 
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Table  I.  Statistic  analysis  of  the  relationship  between  percent  fibroglandular 
tissue  volume  on  breast  MR  images  and  percent  dense  area  on  mammo¬ 
grams  segmented  by  radiologist  and  MDEST. 


Radiologist 

Computer  (MDEST) 

CC  vs 

MRI 

MLO  vs 

MRI 

CC  vs 

MRI 

MLO  vs 

MRI 

Correl.  coeff. 

0.91 

0.91 

0.91 

0.89 

rms  residual 

6.3 

5.6 

5.8 

5.4 

Mean  diff. 

5.7 

3.0 

5.3 

2.6 

dular  adipose  tissue,  than  a  mammogram  does,  thus  reducing 
the  percentage  of  hbroglandular  tissue  volume.  The  reduc¬ 
tion  in  the  percent  volume  values,  however,  is  relatively 
small,  as  found  in  our  study  evaluating  the  effects  of  select¬ 
ing  starting  slices  for  volume  calculation  (Fig.  5).  The  main 
difference  may  therefore  be  attributed  to  the  geometric  rela¬ 
tionship  between  the  volume  and  the  projected  2-D  area, 
explained  later. 

Geometrically,  we  do  not  expect  the  relationship  between 
volume  and  its  projected  2-D  area  to  be  linear.  In  a  hypo¬ 
thetical  situation  such  that  the  dense  tissue  volume  is  a 
sphere  (volume=4/3  rrr^)  enclosed  inside  a  concentric 
spherical  shell  of  fatty  tissue  volume,  the  percent  projected 
2-D  area  (area=  Trr^)  of  the  inner  sphere  relative  to  the 
outer  sphere  is  equal  to  the  percent  volume  to  the  power  of 
2/3.  The  relationship  between  the  percent  area  and  the  per¬ 
cent  volume  is  therefore  not  linear,  and  the  percent  area  is 
larger  in  value  than  the  percent  volume  for  any  ratio  of  radii 
between  the  two  spheres.  In  general,  the  compressed  breast 
and  the  dense  tissue  are  not  spherical.  To  investigate  the 
empirical  relationship  between  the  percent  area  and  the  per¬ 
cent  volume  in  the  nonlinear  situation,  we  applied  least 
squares  hts  in  several  polynomial  models  to  the  data  points 
in  Fig.  7.  The  results  are  shown  in  Table  II  and  Fig.  9.  A 
comparison  of  Table  I  and  Table  II  indicates  that  the  Y 
=  model  (x  =  percent  hbroglandular  tissue  volume,  Y 
=  percent  mammographic  dense  area)  resulted  in  slightly 
larger  rms  residuals  than  the  linear  model.  The  model  Y 
=  kx"'  with  m  equal  to  0.83  and  0.86,  respectively,  for  CC- 
and  MLO-views  slightly  reduced  the  rms  residuals.  The  best 
ht  was  obtained  from  the  model  Y  =  k ^x'”  k2  ■  However,  the 


Table  II.  An  analysis  of  the  relationship  between  percent  fibroglandular 
tissue  volume  (x)  on  breast  MR  images  and  percent  dense  area  (F)  on 
mammograms  segmented  by  radiologist  using  three  mathematical  models. 
m,  k,  /fi  and  kj  are  constants  determined  by  least  squares  curve  fitting. 


Mathematical  model 

Y  =  kx-'^ 

II 

+ 

II 

CC 

Least  squares  Fit 

y=0.82x^« 

F=1.03a“*3 

F=1.02v"''*-0.19 

vs 

rms  residual 

6.5 

6.0 

5.6 

MRI 

Coefficient  of 

determination 

0.82 

0.85 

0.87 

MLO 

Least  squai'es  Fit 

Y=0J3x^'^ 

F=0.96x““ 

F  =  0.90v‘'“-0.09 

VS 

rms  residual 

6.0 

5.5 

5.3 

MRI 

Coefficient  of 

determination 

0.80 

0.84 

0.85 

situation  that  the  percent  projected  area  was  negative  when 
the  percent  volume  was  zero  would  not  occur  physically. 
Note  that  if  the  model  was  htted  to  the  percent  area  data 
segmented  by  MDEST  (Fig.  8),  the  k2  values  would  become 
positive,  indicating  that  the  nonzero  k2  values  are  likely 
caused  by  segmentation  biases. 

Overall,  these  models  demonstrate  that  there  is  no  simple 
mathematical  relationship  between  the  percent  volume  and 
the  percent  projected  area  but  the  values  for  the  exponents 
appeared  to  be  in  a  reasonable  range.  The  relationship  be¬ 
tween  the  percent  volumes  of  two  3-D  objects,  one  within 
another,  and  their  percent  projected  2-D  area  depends  on 
their  shapes.  For  example,  the  closer  the  two  volumes  are  to 
concentric  cylinders  of  the  same  height,  the  closer  the  expo¬ 
nent  is  to  unity.  The  spread  of  the  data  points  can  therefore 
be  attributed  to  the  various  irregular  shapes  of  the  hbroglan¬ 
dular  tissue  in  the  breasts,  the  changes  in  the  shapes  of  the 
fatty  and  hbroglandular  tissue  due  to  compression,  as  well  as 
the  uncertainties  in  the  segmentation  of  both  the  mammo¬ 
grams  and  the  MR  images.  Although  the  spread  of  the  data 
points  in  the  correlation  plots  is  large,  one  can  expect  that 
when  the  mammographic  density  of  a  given  patient  is  moni¬ 
tored  over  time,  the  variations  in  the  projected  dense  area 
due  to  the  geometric  factors,  described  above,  will  actually 
be  much  less  than  that  observed  from  the  scatter  plots  among 
a  large  number  of  patients.  In  other  words,  the  uncertainty  in 
the  estimated  percent  density  from  the  serial  mammograms 
of  a  given  patient  should  be  much  less  than  those  shown  in 


Fig.  9.  Nonlinear  fitting  of  the  relationship  between  the 
percent  volume  and  the  percent  area  segmented  by  a 
radiologist  with  the  least  squai'es  method,  (a)  CC  view, 
(b)  MLO  view.  Dashed  line:  y  =  kx^''^\  dashed^dotted- 
dotted  line:  y  =  kx’^\  solid  line:  y  =  kyx’'^  +  k2.  The  fit¬ 
ted  parameters  of  the  models,  m,  k,  ki  and  k2,  are 
shown  in  Table  II. 


(a) 


(b) 
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Fig.  7.  The  strong  correlation  observed  between  the  percent 
dense  area  on  mammograms  and  the  percent  volumetric  fib- 
roglandular  tissue  on  MR  images  therefore  indicates  that  a 
change  in  mammographic  density  can  be  a  useful  indicator 
of  a  change  in  percent  fibroglandular  tissue  volume  in  the 
breast. 

Recently,  some  researchers  attempted  to  estimate  the 
thickness  of  the  fibroglandular  tissue  in  local  regions  of  the 
mammograms  from  the  projected  density.^’  This  approach  is 
expected  to  provide  a  more  accurate  estimation  of  the  fibro¬ 
glandular  tissue  volume  if  the  true  thicknesses  of  the  fibro¬ 
glandular  tissue  and  fatty  tissue  can  be  determined  at  various 
locations  of  the  projected  breast  region.  The  volume  of  the 
fibroglandular  tissue  can  then  be  summed  over  the  pixels  in 
the  breast  region  and  the  percent  volume  calculated.  How¬ 
ever,  to  obtain  accurate  measurements,  this  approach  requires 
the  knowledge  of  the  sensitometric  curve  for  the  screen-film 
mammogram  at  the  imaging  facility  (or  use  of  a  digital  de¬ 
tector  with  linear  response)  and  other  physical  parameters 
such  as  the  scatter  fraction,  the  beam  quality  and  beam  hard¬ 
ening,  in  addition  to  the  compressed  breast  thickness  and  the 
breast  shape  profile  at  the  periphery.  Some  of  the  require¬ 
ments  may  be  circumvented  by  using  a  look-up  table  prede¬ 
termined  with  a  phantom  calibration.  Other  factors  may  have 
to  be  approximated  or  ignored,  or  require  further  corrections 
by  imaging  each  mammogram  with  a  calibration  phantom 
placed  adjacent  to  the  breast.  This  method  is  still  being  de¬ 
veloped  and  the  accuracy  of  estimating  the  thickness  of  the 
local  fibroglandular  tissue  from  a  mammogram  is  yet  to  be 
determined.  To  our  knowledge,  no  study  to  date  has  demon¬ 
strated  that  fibroglandular  tissue  volume  estimated  from 
mammograms  has  a  higher  correlation  with  the  percent  volu¬ 
metric  fibroglandular  tissue  volume  estimated  from  MR  im¬ 
ages  or  other  volumetric  methods  than  we  found  in  our  cur¬ 
rent  study.  Furthermore,  even  if  the  local  fibroglandular 
tissue  thickness  on  mammograms  can  be  measured  in  a  labo¬ 
ratory  or  in  an  academic  center  using  elaborate  calibration 
schemes,  it  is  doubtful  that  these  methods  can  be  translated 
into  routine  clinical  measurement  in  mammography  clinics. 
Its  use  may  then  be  limited  to  controlled  clinical  trials.  An 
estimation  of  the  percent  dense  area  projected  on  mammo¬ 
grams  is  likely  a  more  practical  approach  for  breast  density 
assessment.  The  high  correlation  between  the  percent  dense 
area  and  the  percent  fibroglandular  tissue  volume  on  MR 
images  as  demonstrated  in  the  current  study  further  supports 
the  validity  of  this  approach. 

V.  CONCLUSION 

In  this  study,  we  investigated  the  correlation  between  the 
percent  mammographic  dense  area  and  the  percent  volumet¬ 
ric  fibroglandular  tissue  as  measured  on  MR  images.  A  semi¬ 
automatic  method  was  developed  for  segmentation  of  the 
MR  images  and  a  fully  automated  computerized  method, 
MDEST,  was  used  to  segment  the  mammograms.  The  per¬ 
formance  of  MDEST  on  the  set  of  mammograms  used  in  this 
study  was  verified  with  an  experienced  radiologist’s  manual 
segmentation.  The  inter-observer  variability  in  segmentation 


of  MR  images  was  found  to  be  small  with  correlation  coef¬ 
ficients  of  0.99.  The  correlation  between  the  percent  volume 
on  MR  images  and  percent  area  segmented  by  a  radiologist 
for  either  CC-  view  or  MLO-view  is  0.91.  The  correlation 
between  percent  volume  and  percent  area  estimated  by  MD¬ 
EST  is  0.91  and  0.89,  respectively,  for  CC  and  MLO  views. 
Mammographic  density  is  thus  highly  correlated  with  the 
percent  volumetric  fibroglandular  tissue  in  the  breast.  The 
high  correlation  indicates  that  changes  in  mammographic 
density  may  be  a  useful  indicator  of  changes  in  fibroglandu¬ 
lar  tissue  volume  in  the  breast.  Our  computerized  image 
analysis  tool,  MDEST,  can  provide  a  consistent  and  repro¬ 
ducible  estimation  of  percent  dense  area  on  routine  clinical 
mammograms.  The  automated  image  analysis  tool  may  im¬ 
prove  the  sensitivity  of  quantifying  mammographic  density 
changes,  thereby  contributing  to  the  understanding  of  the  re¬ 
lationship  of  mammographic  density  to  breast  cancer  risk, 
detection,  and  prognosis,  and  the  prevention  and  treatment  of 
breast  cancer. 
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Mammographic  Density 
Measured  with  Quantitative 
Computer-aided  Method: 

Comparison  with  Radioiogists’ 
Estimates  and  Bi-RADS  Categories^ 


I 

I 


Purpose: 


Materials  and 
Methods: 


Results: 


Conclusion: 


To  retrospectively  compare  computer-aided  mammo¬ 
graphic  density  estimation  (MDEST)  with  radiologist  esti¬ 
mates  of  percentage  density  and  Breast  Imaging  Reporting 
and  Data  System  (BI-RADS)  density  classification. 
Institutional  Review  Roard  approval  was  olttained  for  this 
HIPAA-compliant  study;  patient  informed  consent  require¬ 
ments  were  waived.  A  fully  automated  MDEST  comptiter 
program  was  used  to  measure  hreast  density  on  digitized 
mammograms  in  65  women  (mean  age,  53  years;  range, 
24-89  years).  Pixel  gray  levels  in  detected  hreast  herders 
were  analyzed,  and  dense  areas  were  segmented.  Percentage 
density  was  calculated  by  dividing  the  number  of  dense  pixels 
hy  the  total  number  of  pixels  within  the  borders.  Seven 
hreast  radiologists  (live  trained  with  MDEST,  two  not 
trained)  jirospectively  assigned  qualitative  BTRADS  density 
categories  and  visually  estimated  percentage  density  on  260 
mammograms.  Qualitative  BTRADS  assessments  were  com¬ 
pared  with  new  quantitative  BTRADS  standards.  The  refer¬ 
ence  standard  density  for  this  study  was  estaltlished  by  allow¬ 
ing  the  five  trained  radiologists  to  manipulate  the  MDEST 
gray-level  thresholds,  which  segmented  mammograms  into 
dense  and  nondense  areas.  Statistical  tests  performed  in¬ 
clude  Pearson  correlation  coefficients,  Rland-Altman  agree¬ 
ment  method,  k  statistics,  and  unpaired  t  tests. 

There  was  a  close  correlation  between  the  reference  stan¬ 
dard  and  radiologist-estimated  density  (R  =  0.90-0.95) 
and  MDEST  density  {R  =  0.89).  Untrained  radiologists 
overestimated  percentage  density  by  an  average  of  37%, 
versus  6%  for  trained  radiologists  (P  <  .001).  MDEST 
showed  better  agreement  with  the  reference  standard  (av¬ 
erage  overestimate,  1%;  range,  —15%  to  -F18%).  MDEST 
correlated  better  with  percentage  density  than  with  cjuali- 
tative  BTRADS  categories.  There  were  large  overlaps  and 
ranges  of  percentage  density  in  cjualitative  BTRADS  cate¬ 
gories  2-4.  Qualitative  BTRADS  categories  correlated 
poorly  with  new  quantitative  BTRADS  categories,  and  16 
(6%)  of  260  views  were  erroneously  classified  by  MDEST. 

MDEST  compared  favorably  with  radiologist  estimates  of 
percentage  density  and  is  more  reproducible  than  radiolo¬ 
gist  estimates  when  cjualitative  BTRADS  density  categories 
are  used.  Qualitative  and  quantitative  BTRADS  density 
assessments  differed  markedly. 
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Previous  trreast  density  assess¬ 
ments  performed  liy  using  the 
Breast  Imaging  Reporting  and 
Data  System  (BI-RADS)  have  been  com¬ 
pletely  qualitative.  The  new  (fourth  edi¬ 
tion)  BTRADS  involves  combined  (juali- 
tative  and  quantitative  assessments  (1). 
The  cjuantitative  assessments  are  di¬ 
vided  into  quartiles,  with  category  I  in¬ 
dicating  breast  tissue  that  is  less  than 
25%  glandular;  category  2,  breast  tissue 
that  is  approximately  25%-50%  glandu¬ 
lar;  category  3,  breast  tissue  that  is  ap¬ 
proximately  51%-75%  glandular;  and 
category  4,  breast  tissue  that  is  more 
than  73%  glandular  (1).  The  qualitative 
descriptive  terms  remain  the  same  (1). 
The  correlation  between  the  new  quan¬ 
titative  assessments  and  the  conven¬ 
tional  qualitative  density  assessments 
has  not  been  well  studied.  The  main 
purpose  of  using  the  BTRADS  density 
categories  is  to  indicate  the  relative  sen¬ 
sitivity  of  the  mammographic  examina¬ 
tion  in  the  detection  of  breast  carci¬ 
noma,  which  may  be  lower  in  cases  of 
dense  breasts  (1).  Quantitative  assess¬ 
ment  of  breast  density  may  enable  a 
more  precise  determination  of  differ¬ 
ences  in  breast  density. 

Mammographic  density  is  important 
for  two  main  reasons:  First,  the  sensi¬ 
tivity  of  mammography  in  the  detection 
of  breast  carcinoma  is  lower  in  dense 
breasts  because  dense  fihroglandular 
tissue  may  obscure  calcifications  and 
masses  (1-3).  Second,  there  is  a  direct 
association  between  increased  niamnio- 
graphic  density  and  increased  risk  of 
developing  breast  cancer  (4-10).  In  ad¬ 
dition,  investigators  who  use  quantita¬ 
tive  assessment  of  mammographic  den¬ 
sity  report  higher  odds  ratios  for  the 
development  of  Ijreast  carcinoma  in 
women  with  dense  breasts  compared 
with  the  odds  ratios  reported  by  investi¬ 
gators  who  use  subjective  assessment  of 
density  (7,8,11).  Boyd  et  al  (6)  con¬ 
firmed  the  importance  of  using  precise 
methods  to  determine  mammogra})hic 
density:  They  oliserved  a  2%  increase  in 
the  relative  risk  of  breast  cancer  for 
every  1%  increase  in  mammographic 
density  percentage. 

There  is  also  evidence  that  hor¬ 
monal  therapies,  including  estrogen  and 


tamoxifen  treatments,  can  change  mani- 
mographic  density  (9,12-14)  and  alter 
the  risk  of  breast  carcinoma  (13-18). 
Whether  this  relationship  is  causal  re¬ 
mains  to  be  proved.  A  simple  and  accu¬ 
rate  method  of  measuring  breast  density 
would  Ije  a  useful  tool  for  investigating 
breast  cancer  risk-mammographic  den¬ 
sity  relationships. 

Several  methods  to  objectively 
quantitate  mammographic  density  ex¬ 
ist.  The  original  method,  described  by 
Wolfe  et  al  (11)  in  1987,  involved  the 
use  of  manual  planimetry  to  compute 
the  density  percentage:  The  dense 
white  areas  on  mammograms  were 
manually  traced.  However,  as  the  au¬ 
thors  themselves  noted,  this  method 
was  “tedious  and  time-consuming.”  More 
recent  techniques  have  been  facilitated  by 
the  advent  of  digital  methods  of  acquiring 
and  viewing  mammographic  data.  Al¬ 
though  these  methods  involve  the  use  of 
computers,  some  of  them  are  only  par¬ 
tially  automated  (19,20).  One  such 
method  was  based  on  an  ordinal  rank¬ 
ing  system  rather  than  on  a  density  per¬ 
centage  system  (20) .  More  recent  com¬ 
puterized  programs  have  been  fully  au¬ 
tomated  (10,21,22). 

We  developed  a  method  in  which  a 
fully  automated  mammographic  density 
estimation  (MDEST)  program  is  used  to 
rapidly  determine  the  perimeter  of  the 
breast  and  cjuantitate  the  mammographic 
density  percentage  (23).  Thus,  the  pur¬ 
pose  of  our  study  was  to  retrospectively 
compare  mammographic  densities  deter¬ 
mined  by  using  this  MDEST  program 
with  both  radiologists’  estimates  of  den¬ 
sity  percentage  and  BTRADS  lireast  den¬ 
sity  categories. 


Materials  and  Methods 


Mammogram  Selection  and  Digitization 

The  authors  had  control  of  the  data  and 
the  information  submitted  for  publica¬ 
tion.  The  data  set  comprised  the  four- 
view  craniocaudal  (CC)  and  medio- 
lateral  oblique  (MTO)  mammograms 
obtained  in  65  patients  who  were  ran¬ 
domly  selected  from  a  National  Cancer 
Institute-  designated  comprehensive  can¬ 
cer  center  (University  of  Michigan  Health 


Center)  after  institutional  review  board 
approval  was  obtained.  This  is  the  same 
mammogram  data  set  used  in  a  prior 
study  (23).  The  requirement  for  individ¬ 
ual  patient  informed  consent  was  waived. 
Our  study  was  HIPAA  compliant.  We 
originally  selected  30  consecutive  normal 
four-view  screening  mammograms  for 
analysis.  Four  4  months  later,  to  include 
more  cjualitative  BTRADS  density  cate¬ 
gory  3  and  4  mammograms — since  these 
were  underrepresented  in  the  original 
sample — we  selected  an  additional  15 
consecutive  normal  four-view  nianuno- 
grams  that  had  been  qualitatively  deter¬ 
mined  to  be  dense.  The  ages  of  the  65 
women  ranged  from  24  to  89  years 
(mean,  53  years). 

The  mammograms  had  been  ac¬ 
quired  by  using  Mammography  Quality 
Standards  Act  (MQSA)-approved  GE 
DMR  mammography  units  (GE  Medical 
Systems,  Milwaukee,  Wis)  with  Kodak 
MR2000  (Kodak,  Rochester,  NY)  screen 
and  film  systems.  All  images  were  digi¬ 
tized  by  using  a  LUMISYS  85  laser  film 
scanner  (Tumisys,  Mountain  View,  Calif) 
with  a  pixel  size  of  0.05  X  0.05  mm  and 
4096  gray  levels.  The  gray  levels  were 
linearly  proportional  to  the  optical  densi¬ 
ties,  from  0.1  to  approximately  4.0  optical 
density  units.  The  nominal  optical  density 
range  of  the  scanner  is  0-4,  with  large 
pixel  values  corresponding  to  low  optical 
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density.  Since  the  breast  density  pattern 
does  not  have  to  be  anatyzed  in  high  spa- 
tiai  resohition  (ie,  pixet  size  of  0.03  mni  or 
tess),  the  fiilt-spatiai-resoiiition  mammo¬ 
grams  were  first  smootlied  with  a  16  X  16 
box  fitter  and  snbsampled  by  a  factor  of 
16  to  result  in  0.8-mm  pixel  size  images 
that  were  approximately  256  X  256  pixels 
in  size  for  the  analysis.  Tliis  process  re¬ 
duced  the  processing  time  and  image 
noise.  The  technical  details  are  described 
elsewhere  (23).  However,  a  different 
software  version  of  the  density  program 
was  used  for  this  study. 

Mammogram  Density  Analysis  with 
MDEST 

The  computer  first  tracked  the  breast 
boundary  by  using  a  gradient-based 
edge-tracking  algorithm,  which  has 
been  described  previously  (23).  Tbe 
tracking  of  the  boundaries  of  a  given 
breast  started  from  approximately  tbe 
middle  of  the  breast  image  and  contin¬ 
ued  Ijoth  upward  and  downward  along 
the  boundary.  The  direction  in  which  to 
search  for  a  new  edge  point  was  guided 
by  tbe  previous  edge  [joints.  The  edge 
location  was  determined  by  using  a  gra¬ 
dient  criterion  along  a  band  of  pixels 
perpendicular  to  the  tracking  direction. 
The  detected  boundary  separated  tbe 
breast  from  other  background  features, 
including  the  directly  exposed  area,  pa¬ 
tient  identification  information,  and 
lead  markers,  which  were  excluded  in 
the  svdjsequent  analyses.  Figure  1  shows 
examples  of  the  Ijreast  boundaries  deter¬ 
mined  on  typical  CC-  and  MLO-view 
mammograms.  A  separate  edge-tracking 
algorithm  was  used  to  detect  the  edge  of 
the  pectoral  muscle  on  the  MLO-view 
mammograms  (Fig  1).  The  detected  edge 
usually  is  not  very  smooth  owing  to  noise 
on  the  image.  A  second-order  polynomial 
was  fitted  to  the  detected  edge  points  to 
segment  the  pectoral  region.  The  pectoral 
muscle  on  tbe  MLO  views  was  excluded 
from  the  subsequent  gray-level  histogram 
analyses  and  breast  area  calculations. 

A  dynamic  range-compression  meth¬ 
od  was  used  to  reduce  the  gray-level 
range  of  tbe  histograms  without  affect¬ 
ing  the  relative  areas  of  the  dense  tissue 
region  and  the  entire  breast  region. 
The  histogram  of  the  breast  region  on 


the  dynamic-range-  compressed  mammo¬ 
gram  was  generated,  normalized,  and 
smoothed.  The  histograms  were  analyzed 
Ijy  the  computer  to  formulate  an  auto¬ 
matic  thresholding  routine. 

After  histogram  classification,  a 
gray-level  threshold  was  automatically 
calculated  to  separate  the  fat  and  dense 
glandular  tissue  regions.  The  gray-level 
threshold  depends  on  the  shape  (or 
class)  of  the  histogram.  If  the  histogram 
has  a  single  peak,  the  maximum  entropy 
princi})le- based  method  (24)  is  used  to 
calculate  the  threshold.  If  the  histogram 
has  more  than  one  peak,  the  discrimi¬ 
nant  analysis  method  (25)  is  used.  The 
threshold  is  used  to  separate  tbe  pixels 
in  the  breast  region  into  two  classes: 
The  class  of  pixel  values  above  the 
threshold  corres|)onds  to  dense  tissue, 
and  the  class  of  [)ixel  values  below  the 
threshold  corresponds  to  fat  tissue. 
This  classification  is  represented  on  a 
binary  image  (ie,  segmented  image),  on 
which  dense  [jixels  are  re[)resented  by 


white  and  fat  pixels  are  represented  by 
black  (Fig  2).  Tbe  [jercent  breast  den¬ 
sity  is  then  calculated  as  tbe  number  of 
[jixels  in  tbe  dense  area  divided  by  the 
total  number  of  pixels  in  the  entire 
breast  region. 

Mammogram  Density  Analysis  Performed 
by  Readers 

Seven  MQSA-certified  radiologists 
(K.E.M.,  M.A.H.,  M.A.R.,  J.E.B.,  C.P., 
C.E.B.,  K.A.K.)  independently  evalu¬ 
ated  tbe  data  set.  Five  of  these  radiolo¬ 
gists  (K.E.M.,  M.A.H.,  M.A.R.,  J.E.B., 
C.P.)  were  involved  in  a  prior  study 
(23).  Before  evaluation  of  the  actual 
data  set,  a  set  of  25  training  cases  was 
used  to  familiarize  five  of  the  radiolo¬ 
gists  with  the  MDEST  program  and  as¬ 
sist  them  in  the  visual  estimation  of  the 
[jercentage  of  dense  area  on  the  mam¬ 
mograms.  Thus,  five  radiologists  were 
considered  to  be  trained,  and  two  were 
not.  The  experience  of  the  radiologists 
in  interpreting  mammograms  ranged 


a.  b. 

Figure  1 :  Mammogram  trom  our  image  database  superimposed  with  the  detected  breast  boundary,  (a)  CC 
view,  (b)  MLO  view  with  anterior  breast  boundary  and  peclorai  muscie  boundary. 
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from  1  to  25  years  (mectian,  9  years; 
mean,  10.4  years).  The  seven  radioto- 
gists  received  tlieir  residency  education 
at  six  ctifferent  institutions.  Two  were 
educated  at  the  same  institution. 

A  graphicat  interface  for  disptaying 
and  recording  the  radiotogists’  evatua- 
tions  was  devetoped.  For  a  given  breast, 
the  CC-  and  MLO-view  mammograms 


were  first  disptayed  side-by-side  on  a 
high-spatial-resohition  22-inch  Compacj 
AiphaStation  monitor  (Compaq,  Pato 
Aito,  Calif).  This  monitor  has  a  display 
matrix  size  of  1280  X  1024  pixels.  It  is 
not  Digital  Imaging  and  Communica¬ 
tions  in  Medicine  calibrated,  but  it  al¬ 
lows  one  to  adjust  contrast  and  bright¬ 
ness  settings,  and  we  adjusted  these 


at  the  beginning  of  tbe  study  according 
to  tbe  subjective  impressions  of  an  ex¬ 
perienced  MQSA-certified  radiologist 
(M.A.H.).  For  each  mammogram,  the 
radiologists  were  able  to  adjust  tbe 
window  and  level  settings  on  the  dis¬ 
play  screen. 

Qualitative  BI-RADS  density  classi¬ 
fications. — The  radiologist  first  assigned 
each  two-view  mammogram  to  one  of 
the  fotir  conventional  Bl-RADS  qualita¬ 
tive  density  categories  (eg,  category  1, 
indicating  fat  tissue).  This  Bl-RADS 
density  assessment  system  does  not  in¬ 
clude  any  quantitative  classification  used 
in  the  new  (fourth  edition)  American  Col¬ 
lege  of  Radiology  Bl-RADS  (1),  which  was 
not  published  at  the  time  of  the  study. 
Herein,  tbe  scores  used  in  the  new  Rl- 
RADS  classification  system  are  referred 
to  as  “qualitative  Bl-RADS  categories.” 

Quantitative  estimate  of  density 
percentage. — Next,  the  radiologist  visu¬ 
ally  estimated  the  density  percentage  on 
each  mammogram  by  selecting  one  of 
the  10%  density  ranges  displayed  on  the 
screen.  Ten  density  percentage  incre¬ 
ments  (eg,  1%-10%,  10%-20%)  were 
used  because  we  believed  that  it  would 
be  too  difficult  for  the  radiologists  to 
visually  estimate  density  to  the  near¬ 
est  1%. 

Determination  of  reference-stan¬ 
dard  density. — After  the  subjective  ra¬ 
diologist  evaluation,  each  view  (CC  or 
MLO)  was  disjjlayed  sequentially.  The 
displayed  material  included  the  original 
mammogram,  the  enhanced  mammo¬ 
gram,  the  histogram  of  the  breast  re¬ 
gion  in  that  view,  and  the  corresponding 
binary  image  created  by  thresholding 
the  histogram.  The  enhanced  image  was 
generated  by  tbe  MDEST  program  dur¬ 
ing  the  density  segmentation.  This  im¬ 
age  was  basically  a  version  of  the  origi¬ 
nal  mammogram  with  the  contrast  of 
structures  enhanced.  The  radiologist 
was  then  able  to  manipulate  the  gray- 
level  threshold  by  interactively  moving  a 
slider  along  the  horizontal  axis  of  the 
histogram.  The  binary  image  changed 
simultaneously  with  the  chosen  thresh¬ 
old  so  that  the  radiologist  could  deter¬ 
mine  whether  the  segmented  white  area 
corresponded  to  the  dense  white  area 
on  the  mammogram.  The  radiologist 


Figure  2 
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Figure  2:  Four  representative  CC  mammograms  with  corresponding  segmented  images,  density  esti¬ 
mates,  and  Bl-RADS  categories  determined  by  trained  radiologists.  Note  the  ditference  between  the  quantita¬ 
tive  and  qualitative  Bl-RADS  categories.  Although  the  two  middle  (second  and  third  images  from  top)  mam¬ 
mograms  have  similar  amounts  of  segmented  breast  density,  the  density  percentage  is  greater  in  the  smaller 
breast  owing  to  less  fat  tissue. 
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was  instructed  to  change  tlie  amount  of 
segmented  dense  area  to  resemble  the 
area  that  lie  or  she  would  trace  liy  hand 
if  lie  or  she  were  performing  manual 
planimetry  (11).  When  the  radiologist 
determined  that  the  segmented  area 
was  accurate,  he  or  she  clicked  a  button 
to  record  the  gray-level  threshold  and 
density  jiercentage  for  each  image. 
Since  no  reference  standard  exists  for 
breast  density  measurements,  we  used 
this  value — averagetl  for  five  radiolo¬ 
gists  previously  trained  with  the  training 
cases — as  the  reference-standard  den¬ 
sity  percentage  for  each  view. 

The  radiologist  was  blinded  to  his  or 
her  own  estimated  density  percentage 
value  obtained  and  thus  could  not  at¬ 
tempt  to  match  his  or  her  density  per¬ 
centage  estimate  for  the  different  views 
or  for  different  breasts  of  the  same  pa¬ 
tient.  The  mammogram  of  the  con¬ 
tralateral  breast  of  the  same  patient  was 
then  displayed  and  evaluated  in  the 
same  way.  The  entire  process  was  re¬ 
peated  for  each  patient  until  the  imaging 
data  of  all  patients  in  the  data  set  were 
evaluated.  We  also  recorded  how  long  it 
took  the  radiologists  to  complete  their 
evaluations  of  the  mammograms. 

During  the  training  session  for  the 
five  radiologists,  both  the  percentage  of 
dense  area  derived  by  the  MDEST  pro¬ 
gram  and  that  determinetl  by  using  inter¬ 
active  thresholding  were  presented  to  the 
radiologists  so  that  they  could  compare 
these  two  percentages  with  their  visually 
estimated  density  percentage  for  each  im¬ 
age.  The  percent  dense  areas  derived  by 
using  MDEST  and  interactive  thresbold- 
ing  were  not  displayed  during  the  actual 
study.  To  assess  the  effect  of  training,  two 
additional  breast  imaging  radiologists, 
who  had  not  undergone  training  to  visu¬ 
ally  estimate  density  percentage  with  the 
25  training  cases,  evaluated  the  same  set 
of  study  images. 

Statistical  Analyses 

Pearson  correlation  coefficients  were 
cafculated  to  examine  the  associations 
of  the  qualitative  BTRADS-,  MDEST-, 
and  trained  radiologist- estimated  mam- 
mographic  densities  with  the  true  (ie,  ref¬ 
erence-standard)  niammograj)hic  den¬ 
sity.  To  assess  the  agreement  between 


Table  1  1 

Descriptive  Statistics  for  Breast  Density  on  CC  Views, 
Radiologists 

Estimated  by  Seven 

Datum* 

Mean  Value^ 

Minimal  Value 

Maximal  Value 

Reader  1 

BI-RADS  category 

2.38  ±  0.85 

1 

4 

MDEST  density  (%) 

0.25  ±  0.15 

0.07 

0.76 

Radiologist  density  estimate  (%) 

0.30  ±  0.25 

0.05 

0.85 

Reference-standard  density  (%) 

0.22  ±  0.16 

0.02 

0.59 

Reader  2 

BI-RADS  category 

2.44  ±  0.90 

1 

4 

MDEST  density  (%) 

0.25  ±  0.15 

0.07 

0.76 

Radiologist  density  estimate  (%) 

0.23  ±  0.21 

0.05 

0.85 

Reference-standard  density  (%) 

0.23  ±  0.18 

0.02 

0.67 

Reader  3 

BI-RADS  category 

2.65  ±  0.88 

1 

4 

MDEST  density  (%) 

0.25  ±  0.15 

0.07 

0.76 

Radiologist  density  estimate  (%) 

0.27  ±  0.19 

0.05 

0.65 

Reference-standard  density  (%) 

0.25  ±  0.19 

0.02 

0.72 

Reader  4 

BI-RADS  category 

2.52  ±  1.08 

1 

4 

MDEST  density  (%) 

0.25  ±  0.15 

0.07 

0.76 

Radiologist  density  estimate  (%) 

0.34  ±  0.25 

0.05 

0.85 

Reference-standard  density  (%) 

0.27  ±  0.20 

0.01 

0.72 

Reader  5 

BI-RADS  category 

2.52  ±  1.02 

1 

4 

MDEST  density  (%) 

0.25  ±  0.15 

0.07 

0.76 

Radiologist  density  estimate  (%) 

0.31  ±  0.26 

0.05 

0.85 

Reference-standard  density  (%) 

0.25  ±  0.18 

0.02 

0.73 

Reader  6 

BI-RADS  category 

2.5  ±  0.90 

1 

4 

MDEST  density  (%) 

0.25  ±  0.15 

0.07 

0.76 

Radiologist  density  estimate  (%) 

0.42  ±  0.26 

0.05 

0.95 

Reader  7 

BI-RADS  category 

2.32  ±  1.06 

1 

4 

MDEST  density  (%) 

0.25  ±  0.15 

0.07 

0.76 

Radiologist  density  estimate  (%) 

0.44  ±  0.27 

0.05 

0.95 

*  Readers  1-5  were  trained  in  using  the  MDEST  method,  and  readers  6  and  7  were  not.  Radiologist  estimates  of  breast  density 
were  based  on  visual  estimates  of  density  percentage. 

^  Mean  values  ±  standard  deviations. 

the  reference-standard  density  and  both 
the  MDEST-  and  the  trained  radiologist- 
estimated  densities  and  to  obtain  95% 
limits  of  agreement,  the  method  of  Bland 
and  Altman  was  used  (26).  Interreader 
agreement  among  the  radiologists  was 
measured  by  using  k  statistics  (27).  The 
strengths  of  agreement  were  expressed  in 
K  values:  A  value  of  0.20  or  less  indicated 
poor;  0.21-0.40,  fair;  0.41-0.60,  moder¬ 
ate;  0.61-0.80,  good;  and  0.81-1.00, 
very  good  agreement.  The  significance  of 
differences  in  overestimations  of  density 
between  the  trained  and  untrained  radiol¬ 


ogists  was  estimated  by  using  the  un¬ 
paired  t  test.  For  the  statistical  calcula¬ 
tions,  the  radiologists’  density  percentage 
estimates  were  expressed  as  the  mean  of 
the  10%  range  (eg,  for  1%-10%,  5%  was 
used).  Software,  including  SAS  (SAS  In¬ 
stitute,  Cary,  NC)  and  Microsoft  Excel 
(Redmond,  Wash),  was  used  to  perform 
all  statistical  analyses. 


Results 


We  excluded  16  (6%)  of  the  260  mammo- 
grapbic  views  owing  to  technical  prob- 


660 


Radiology:  Volume  240:  Number  3 — September  2006 


BREAST  IMAGING:  Computer-aided  Measurement  ot  Mammographic  Density 


Martin  etal 


o 


tenis  that  were  secondary  to  improper 
breast  boundary  detection  or  the  MDEST 
program’s  gross  misclassification  of  the 
gray-level  histograms.  The  MDEST  pro¬ 
gram  performed  well  in  most  cases.  De¬ 
scriptive  statistics  for  the  four  metliods  of 
evaluating  mammographic  density  are 
presented,  according  to  radiologist  and 
view,  in  Tables  1  and  2. 

Density  Analyses 

Pearson  correlation  coefficients  for  cor¬ 
relations  between  the  reference-stan¬ 


dard  density  and  the  qualitative  BI¬ 
RADS-,  MDEST-,  and  trained  radiolo¬ 
gist-estimated  densities  showed  strong 
positive  linear  relationships  (Table  3). 
These  positive  correlations  indicate  that 
as  the  reference-standard  density  per¬ 
centage  increased,  the  values  obtained 
with  the  other  methods  also  tended  to 
increase.  Of  these  three  estimated  den¬ 
sities,  the  trainetl  radiologists’  estimates 
had  the  highest  correlation  with  the  ref¬ 
erence-standard  density  (f?  =  0.90),  the 
MDEST  measurements  had  the  second 


highest  correlation  (R  =  0.89),  and  the 
qualitative  BTRADS  categories  had  the 
third  highest  correlation  (R  =  0.83). 
The  correlation  of  each  method  with  the 
reference-standard  method  was  better 
on  the  CC  views  than  on  the  MLO  views. 
Correlation  coefficients  for  agreement 
between  the  CC  and  MLO  views  of  the 
same  breast  were  0.85  with  MDEST; 
0.96  with  the  reference-standard 
method;  and  0.94,  0.95,  0.94,  0.93,  and 
0.94  with  the  estimates  of  the  five  trained 
radiologists. 


Agreement  of  Reference-Standard  Density 


1  Table  2 

With  MDEST-  and  Trained  Radiologist- 
estimated  Densities 

The  method  of  Bland  and  Altman  (26) 

Descriptive  Statistics  for  Breast  Density  on  MLO  Views,  Estimated  by  Seven 

Radiologists 

was  usetl  to  assess  agreement  between 

Datum* 

Mean  Value+ 

Minimal  Value 

Maximal  Value 

the  reference-standard  density  and  both 
the  MDEST-  and  the  trained  radiologist- 

Reader  1 

BI-RADS  category 

2.37  ±  0.87 

1 

4 

bias  for  the  comparison  between  the 
trained  radiologists’  density  estimates 
and  the  reference-standard  measurement 
was  an  overestimation  of  6%,  compared 

MDEST  density  (%) 

0.25  ±  0.17 

0.06 

0.82 

Radiologist  density  estimate  (%) 

0.31  ±  0.26 

0.05 

0.95 

Reterence-standard  density  (%) 

0.20  ±  0.16 

0.01 

0.74 

Reader  2 

with  an  overestimation  of  1%  by  the 
MDEST  program  (Table  4).  The  95%  lim¬ 
its  of  agreement  between  the  trained 
radiologist-  estimated  and  reference- 
standard  densities  were  wider  (  —  16% 

Bi-RADS  category 

2.42  ±  0.91 

1 

4 

MDEST  density  (%) 

0.25  ±  0.17 

0.06 

0.82 

Radiologist  density  estimate  (%) 

0.23  ±  0.21 

0.05 

0.85 

Reference-standard  density  (%) 

0.22  ±  0.17 

0.03 

0.77 

Reader  3 

BI-RADS  category 

2.65  ±  0.87 

1 

4 

to  -1-27%),  indicating  greater  error  in 

MDEST  density  (%) 

0.25  ±  0.17 

0.06 

0.82 

meastiring  density  by  using  radiologist 

Radiologist  density  estimate  (%) 

0.27  ±  0.19 

0.05 

0.75 

estimates  than  by  using  MDEST.  The 

Reference-standard  density  (%) 

0.24  ±  0.18 

0.02 

0.76 

overall  limits  of  agreement  between  the 

Reader  4 

reference-standard  and  MDEST  densi¬ 

BI-RADS  category 

2.51  ±  1.07 

1 

4 

ties  ranged  from  —15%  to  -1-18%.  The 

MDEST  density  (%) 

0.25  ±  0.17 

0.06 

0.82 

MDEST  program  tended  to  overesti¬ 

Radiologist  density  estimate  (%) 

0.34  ±  0.25 

0.05 

0.85 

mate  mammographic  density  (up  to 

Reference-standard  density  (%) 

0.26  ±  0.2 

0.01 

0.82 

+  18%)  more  than  it  underestimated  it 

Reader  5 

(up  to  —15%). 

BI-RADS  category 

2.52  ±  1.03 

1 

4 

MDEST  density  (%) 

0.25  ±  0.17 

0.06 

0.82 

Untrained  Radiologists 

Radiologist  density  estimate  (%) 

0.31  ±  0.26 

0.05 

0.85 

The  densities  estimated  by  tbe  two  un¬ 
trained  breast  imagers  bad  excellent 
correlation  with  the  reference-standard 
meastirement  (i?  =  .95).  The  untrained 
radiologists  overestimated  the  niammo- 
graphic  density  percentage  to  a  greater 
extent  than  did  the  trained  radiologists. 
The  untrained  radiologists  overesti¬ 

Reference-standard  density  (%) 

0.23  ±  0.17 

0.02 

0.8 

Reader  6 

BI-RADS  category 

2.5  ±  0.90 

1 

4 

MDEST  density  (%) 

0.25  ±  0.17 

0.06 

0.82 

Radiologist  density  estimate  (%) 

0.42  ±  0.26 

0.05 

0.95 

Reader  7 

BI-RADS  category 

2.32  ±  1.06 

1 

4 

MDEST  density  (%) 

0.25  ±  0.17 

0.06 

0.82 

Radiologist  density  estimate  (%) 

0.44  ±  0.27 

0.05 

0.95 

mated  density  by  37%  with  respect  to 
the  reference-standard  density  (Eig  3). 
This  was  in  contrast  to  the  6%  overesti¬ 
mation  of  the  trained  radiologists  {P  < 
.001).  There  was  no  significant  differ- 

*  Readers  1-5  were  trained  in  using  the  MDEST  method,  and  readers  6  and  7  were  not.  Radiologist  estimates  of  breast  density 
were  based  on  visual  estimates  of  density  percentage. 

^  Mean  values  ±  standard  deviations. 
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Table  4 


Mean  Agreement  between  Density  Measurement  Methods 

95%  Limits  of  Agreement 

Correiation  Method  Mean  Bias*  Lower  Upper 

Reference-Standard  versus  MDEST  Density 


By  view 

CC 

-0.0036  ±  0.0797 

-0.1630 

0.1558 

MLO 

-0.0244  ±  0.0824 

-0.1892 

0.1404 

Overall 

-0.01384  ±  0.0816 

-0.1770 

0.1494 

Reference-Standard  Density  versus  Trained  Radiologists’  Estimates 

By  view 

CC 

-0.0469  ±  0.1029 

-0.2527 

0.1589 

MLO 

-0.0634  ±0.1117 

-0.2868 

0.1600 

Overall 

-0.0551  ±  0.1076 

-0.2703 

0.1601 

*  Mean  bias  values  ±  standard  deviations. 


erice  in  tlie  assignment  of  qiiatitative  BI¬ 
RADS  (tensity  categories  between  the 
trained  and  untrained  rattiotogists  (P  > 
.43). 

Comparison  with  BI-RADS 

A  wide  range  of  jrercent  densities  were 
assigned  to  mammograms  ctassified  in 
ttiree  of  the  four  qualitative  BI-RADS 
categories  (BI-RADS  2-4)  (Table  5). 
For  example,  qualitative  BI-RADS  cate¬ 
gory  4  included  mammograms  with  den¬ 
sities  of  Iretween  20%  and  82%.  There 
was  also  a  large  range  of  qualitative  BI¬ 
RADS  categories  assigned  to  the  same 
range  of  percent  densities.  A  mammo¬ 
gram  with  0%-24%  density  may  have 
been  assigned  to  qualitative  BI-RADS 
category  1,  2,  or  3  (Table  6)  (Fig  2). 

In  Table  6,  the  new  quantitative  BI¬ 
RADS  density  quartiles  (0%-24%,  23%- 
49%,  50%-74%,  and  75%-100%)  are 
compared  with  the  conventional  (jualita- 
tive  BI-RADS  categories.  The  reference- 
standard  percent  densities  are  grouped  to 
mimic  the  new  (juantitative  BI-RADS 
breast  density  quartiles.  There  was  poor 
agreement  between  the  qualitative  BI¬ 
RADS  density  classifications  and  the  new 
(juantitative  BI-RADS  density  classifica¬ 
tions,  with  use  of  the  reference-standard 
density  as  truth.  If  the  new  combined 
(jualitative  and  (juantitative  BFRADS  cat¬ 
egory  4 — indicating  extremely  dense 
breast  tissue — corresjjonded  to  greater 
than  75%  breast  density,  far  fewer  cases 
would  be  classified  as  BFRADS  4  com¬ 
pared  with  the  number  of  cases  that 
would  be  classified  as  (jualitative  BI-RADS 
4.  With  use  of  the  qualitative  BI-RADS 
system,  110  (17%)  of  the  630  cases  (130 
breasts  times  five  radiologists  equals  630 
cases)  in  our  study  were  assigned  to  cate¬ 
gory  4.  With  use  of  the  new  (juantitative 
system,  none  of  the  650  cases  was  as¬ 
signed  to  category  4.  With  use  of  the  old 
(jualitative  BI-RADS  system,  108  (17%) 
of  the  650  cases  were  classified  as  fatty 
(category  1);  with  use  of  the  new  BF 
RADS  quantitative  system,  370  (57%) 
cases  were  classified  as  fatty  (category  1), 
representing  a  downstaging  of  40%. 

Interobserver  Agreement 

The  interohserver  agreement  values  ob¬ 
served  for  each  density  measurement 


method  indicate  that  there  was  strong 
agreement  among  the  trained  radiolo¬ 
gists.  Intraclass  correlation  coefficients 
were  0.88  for  the  radiologists’  estimates 
of  density  jiercentage  and  0.94  for  the 
radiologists’  determinations  of  the  ref¬ 
erence-standard  density  percentage,  in¬ 
dicating  very  good  agreement  among 
the  radiologists’  density  measurements 
obtained  with  these  two  methods.  Pair¬ 
wise  comparison  of  the  radiologists’  as¬ 
signments  of  (jualitative  BFRADS  cate¬ 
gories  revealed  good  but  lower  agree¬ 
ment,  with  K  values  (27)  ranging  from 
0.61  to  0.76. 

Time 

The  mean  time  to  comjilete  the  quali¬ 
tative  BFRADS  density  category  as¬ 
signments  and  density  jiercentage  es¬ 
timations  with  the  MDEST  jjrogram 
was  18  seconds  per  view  (range, 
13-22  seconds),  with  a  mean  standard 


deviation  of  8  seconds  (range,  7-9  sec¬ 
onds). 


Discussion 


Our  study  findings  demonstrate  that 
quantified  breast  density  is  a  more  ac¬ 
curate  and  rej)roducil)le  measure  of 
breast  density  than  radiologist  esti¬ 
mates  derived  by  using  the  conventional 
(jualitative  BFRADS  density  categories. 
In  our  study,  we  observed  large  ranges 
of  jDerceiit  densities  among  cases  classi¬ 
fied  in  three  of  the  four  (jualitative  BI- 
RADS  density  categories,  with  up  to  a 
62%  range  in  the  category  4  cases.  We 
also  observed  a  large  overlaj)  in  the 
mammograjbhic  density  jiercentages  as¬ 
signed  to  cases  classified  in  the  four 
qualitative  BFRADS  categories  l)y  five 
Ijreast  imaging  sjiecialists,  with  ujb  to  a 
40%  range  overlaj)  between  categories 
3  and  4.  A  mammogram  with  30%  total 
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breast  density  coiitd  be  assigned  to  quat- 
itative  BI-RADS  category  2,  3,  or  4; 
however,  under  the  new  system,  it 
would  be  assigned  to  category  2. 

Similar  results  were  found  in  a  com¬ 
parison  lietween  the  qualitative  esti¬ 
mates  based  on  the  Wolfe  parencliymal 
patterns  and  the  quantitative  determi¬ 
nations  of  density  made  try  using  manual 
planimetry  (28).  This  is  not  surprising, 
given  the  sulrjective  nature  of  hotli  the 
qualitative  BI-RADS  density  categories 
and  the  Wolfe  parenchymal  patterns. 
More  recently,  Wang  et  al  (21)  sug¬ 
gested  that  the  visual  density  percent¬ 
age  estimates  derived  hy  three  mam- 
mographers  may  have  led  to  the  same 
mammogram  Iming  assigned  to  differ¬ 
ent  qualitative  BI-RADS  categories. 
With  use  of  quantitative  percent  hreast 
density  determinations,  one  is  more 
likely  to  detect  subtle  changes  in  hreast 
density  that  may  he  masked  when  they 


Table  5 


Range  of  Mammographic  Density 
Percentages  Assigned  to  Qualitative 
BI-RADS  Categories  by  Trained 
Radiologists 


Qualitative  BI-RADS 

Density  Percentage 

Category 

Range  (%) 

1 

1-11 

2 

2-45 

3 

8-60 

4 

20-82 

are  classified  in  the  same  BI-RADS  cate¬ 
gory  as  the  overall  hreast  density. 

Our  results  show  that  experienced 
radiologists’  subjective  density  assess¬ 
ments  based  on  qualitative  BI-RADS 
categories  may  be  quite  different  from 
density  assessments  based  on  quantita¬ 
tive  BI-RADS  categories.  For  example, 
none  of  the  650  mammographic  cases 
judged  to  have  qualitative  category  4 
density  had  greater  than  75%  breast 
density  according  to  quantitative  BI¬ 
RADS  measures.  In  addition,  the  cases 
with  0%-24%  quantitative  BI-RADS 
category  1  (fatty)  breast  density  would 
have  encompassed  a  majority  (370 


Figure  3 


Figure  3:  Graph  illustrates  mammographic 
density  percentage  visually  estimated  by  two  un¬ 
trained  radiologists  versus  reterence-standard 
density  percentage.  The  untrained  radiologists 
significantly  overestimated  the  percent  breast 
density  relative  to  the  reterence-standard  density 
determined  bythetrained  radiologists  (P<  .001). 


Table  6 


Qualitative  BI-RADS  versus  Quantitative  Density  Determinations 

Quantitative  Density 

Qualitative  BI-RADS  Score* 

Percentage^ 

1 

2 

3 

4 

0-24 

17(108/650) 

29  (189/650) 

1 1  (73/650) 

0 

25-49 

0 

3  (20/650) 

21  (136/650) 

6  (39/650) 

50-74 

0 

0 

2(14/650) 

11  (71/650) 

75-100 

0 

0 

0 

0 

*  Data  are  percentages  of  cases  with  the  given  quantitative  density  percentage  that  were  assigned  the  given  BI-RADS  score. 
The  numbers  of  cases  used  to  calcuiate  the  percentages  are  in  parentheses. 

tThe  quantitative  (ie,  reference-standard)  breast  density  ranges  correspond  to  the  new  quantitative  BI-RADS  density 
categories.  Each  breast  (depicted  on  CC  and  MLO  views)  was  assigned  a  single  qualitative  BI-RADS  score  by  five  trained 
radiologists  (130  breasts  times  five  radioiogists  equals  650  cases). 


[57%]  of  650)  of  the  mammograms, 
many  of  wliich  were  conventioually  as¬ 
signed  to  qualitative  BI-RADS  category 
2  or  3. 

A  goal  of  the  BI-RADS  system  is  to 
facilitate  uniformity  of  pliysician  re¬ 
ports,  and  our  results  suggest  tliat  addi¬ 
tional  training  would  he  necessary  to 
enable  physicians  to  accurately  trans¬ 
late  a  vistial  assessment  of  density  per¬ 
centage  into  a  quantitative  assessment, 
as  recommended  by  the  new  BI-RADS 
standards  (1).  In  our  study,  the  two  un¬ 
trained  radiologists  overestimated  den¬ 
sity  Ity  37%.  However,  radiologists 
could  be  rapidly  trained  to  estimate 
Itreast  density  by  using  a  computerized 
density  measurement  program  so  that 
they  could  follow  the  new  (juantitative 
BI-RADS  density  classifications.  Fur¬ 
thermore,  use  of  the  new  (juantitative 
BI-RADS  assessment  may  lead  to  the 
“down  coding”  of  lu'east  density  and 
tints  the  creation  of  nonuniformity  be¬ 
tween  the  old  and  new  standards  and 
the  consequent  hindering  of  longitudinal 
research.  A  case  jtreviously  assigned  to 
BI-RADS  category  3  may  now  be  as¬ 
signed  to  BI-RADS  category  1.  Since 
very  few  breasts  liave  greater  than  75% 
density,  the  new  quantified  BI-RADS 
may  functionally  ajtproach  a  three-den¬ 
sity-level  system,  with  the  majority  of 
mammograjiliic  cases  assigned  to  cate¬ 
gories  1  and  2. 

The  MDEST  breast  density  determi¬ 
nations  were  more  accurate  than  the 
radiologists’  visual  estimates  of  hreast 
density,  as  indicated  Ity  the  radiologists’ 
overestimating  of  breast  density  to  a 
greater  degree  and  their  larger  varia¬ 
tion  in  density  estimates  (relative  to  the 
reference-standard  density)  comjjared 
with  the  MDEST  measurements.  We 
also  observed  good  correlations  be¬ 
tween  MDEST-derived  density  percent¬ 
age  and  radiologist-determined  refer¬ 
ence-standard  density  percentage  {R  = 
.89).  Although  this  correlation  was 
slightly  lower  than  that  between  the 
trained  radiologist- estimated  and  ref¬ 
erence-standard  densities  [R  =  0.90), 
the  MDEST-derived  densities  had  tighter 
agreement  with  the  reference-standard 
measurements  than  did  the  radiologists’ 
estimates.  The  MDEST  jtrogram  overesti- 
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mated  density  by  a  mean  of  only  1% 
(range,  -15%  to  -1-18%),  as  compared 
with  a  mean  overestimation  of  density 
of  6%  l)y  the  trained  radiologists  (witli  a 
wider  range:  —16%  to  +27%)  relative 
to  the  reference-standard  density  per¬ 
centage. 

Correlation  coefficients  for  agree¬ 
ment  on  density  measurement  between 
the  CC  and  MLO  views  favored  the 
trained  radiologists,  altlioiigli  both  the 
MDEST  program  and  the  radiologists 
liad  good  correlation.  The  MDEST  pro¬ 
gram  performed  better  than  the  un¬ 
trained  radiologists  in  the  estimation  of 
percent  breast  density.  The  two  un¬ 
trained  breast  imagers  tended  to  over¬ 
estimate  density  percentage  by  approx¬ 
imately  37%,  which  was  greater  than 
the  percentage  of  overestimation  by  the 
trained  breast  imagers.  These  findings 
are  in  contrast  to  those  of  Lee-Han  et  al 
(28):  The  single  radiologist  in  their 
study  slightly  underestimated  the  den¬ 
sity  percentage  relative  to  the  measured 
area  of  density.  This  result  may  have 
been  secondary  to  some  form  of  density 
percentage  estimation  training  received 
by  the  radiologist,  although  this  was  not 
specified. 

Many  of  the  computer  programs 
previously  used  to  evaluate  breast  den¬ 
sity  have  been  only  partially  automated. 
The  density  measurement  methods 
used  by  Byng  et  al  (19)  and  Boone  et  al 
(20)  involved  manual  cropping  of  the 
pectoral  muscle  to  determine  the  breast 
area  on  mammograms.  In  addition,  the 
Byng  et  al  method  involved  manual  de¬ 
terminations  of  both  breast  edge  and 
breast  density  gray -level  thresholds. 
With  the  Byng  et  al  method,  it  took  less 
than  a  minute  to  evaluate  each  image 
(19).  Our  fully  automated  program  au¬ 
tomatically  detects  the  breast  edge, 
crops  the  pectoral  muscle,  and  esti¬ 
mates  the  gray-level  threshold  for  den¬ 
sity  segmentation.  In  addition,  if  manual 
interactive  thresholding  (the  reference- 
standard  method  used  in  the  current 
study)  is  preferred,  the  MDEST  user 
interface  is  fast  and  simple  to  use,  re¬ 
quiring  an  average  of  18  seconds  per 
view  to  evaluate  both  the  BI-BADS  cat¬ 
egory-based  density  and  the  density 
percentage.  Investigators  in  two  other 


studies  (10,21)  have  described  fully  auto¬ 
mated  programs  for  determining  breast 
density. 

We  found  that  percent  breast  den¬ 
sity  determinations  were  more  accurate 
on  CC  views  than  on  MLO  views.  This 
was  true  for  both  the  MDEST  densities 
and  the  radiologist  visual  density  esti¬ 
mates.  The  MDEST  program  overesti¬ 
mated  density  percentage  by  a  mean  of 
2.4%  on  the  MLO  views  and  by  a  mean 
of  0.4%  on  the  CC  views.  The  radiolo¬ 
gists  overestimated  density  by  a  mean 
of  6.3%  on  the  MLO  views  and  by  a 
mean  of  4.7%  on  the  CC  views.  These 
data  suggest  that  in  the  future,  CC  views 
alone  may  be  adequate  for  assessing 
percent  breast  density  in  temporal  mea¬ 
surements. 

There  were  several  limitations  to 
our  sttidy.  The  MDEST  program  had 
technical  errors,  which  led  to  a  6%  case 
rejection  rate.  Technical  errors  in¬ 
cluded  inaccurate  breast  border  detec¬ 
tion  and  gross  misclassification  of  the 
gray-scale  histograms.  Errors  in  both 
the  anterior  breast  border  detection  al¬ 
gorithm  and  the  pectoral  muscle  detec¬ 
tion  algorithm  occurred  and  resulted  in 
inaccurate  breast  tissue  area  determi¬ 
nations.  Misclassification  of  the  gray¬ 
scale  histograms  resulted  in  imjn’oper 
gray-level  threshold  determination,  in¬ 
accurate  segmentation  of  the  dense  ar¬ 
eas,  and  inaccurate  density  percentage 
calculations.  These  histogram  misclassi¬ 
fication  errors  occurred  more  often  on 
the  mammograms  with  extremely  dense 
and  fatty  pixels.  Thus,  MDEST  cannot 
yet  be  used  as  a  stand-alone  density 
measurement  method.  Although  further 
development  of  computer  visualization 
techniques  and  additional  training  with 
a  large  data  set  are  needed  to  improve 
the  accuracy  and  robustness  of  MDEST, 
the  results  of  this  study  demonstrate  the 
feasibility  of  our  approach  and  the 
promise  of  using  an  automated  or  semi¬ 
automatic  system  like  MDEST  to  aid  fu¬ 
ture  research  efforts  in  the  investigation 
of  mammographic  breast  density. 

Another  limitation  of  our  study  was 
that  the  BI-BADS  qualitative  assess¬ 
ments  were  subjective  and  could  be  in¬ 
stitutionally  defined.  Six  (86%)  of  the 
seven  radiologists  received  residency 


training  at  different  institutions,  so 
strong  institutional  bias  was  less  likely 
in  this  study.  Also,  there  is  no  reference 
standard  for  determining  breast  den¬ 
sity,  so  there  will  always  be  some  sub¬ 
jective  difference  in  determining  niam- 
mographic  density,  even  when  manual 
segmentation  is  used.  The  averaging  of 
five  radiologists’  segmentations  may 
have  partially  reduced  this  bias. 

Our  MDEST  program  calculates  the 
area  of  mammographic  breast  density, 
which  correlates  with  the  area  of  fi- 
broglandular  tissue  that  is  present. 
However,  volume  is  a  more  accurate 
measure  of  the  amount  of  breast  tissue 
than  is  area.  Accurate  determination  of 
the  dense  tissue  volume  requires  sensi- 
tometry  and  scatter  and  beam-harden¬ 
ing  corrections  for  each  mammogram. 
A  rough  estimate  of  dense  tissue  volume 
could  be  determined  by  multiplying  the 
breast  thickness,  which  is  recorded  for 
each  mammogram  at  our  institution,  by 
the  area  of  dense  tissue.  Wang  et  al  (21) 
described  a  computer-aided  detection 
method  that  is  more  accurate  for  esti¬ 
mating  dense  mammographic  tissue 
composition  because  it  involves  the  use 
of  a  tissue-tbickness-correction  algo¬ 
rithm.  This  concept  of  breast  tissue  vol¬ 
ume  may  be  of  importance  in  the  study 
of  breast  cancer  risk,  because  it  is  prob¬ 
ably  the  volume  of  dense  glandular 
breast  tissue — rather  than  the  density 
of  breast  tissue — that  determines  risk 
(9).  Wei  et  al  (22)  recently  observed  a 
high  correlation  between  otir  auto¬ 
mated  mammographic  MDEST  assess¬ 
ment  method  and  volumetric  fibroglan- 
dular  tissue  estimation  at  breast  mag¬ 
netic  resonance  imaging,  suggesting 
that  estimates  of  change  in  mammo- 
grapbic  density  are  close  surrogates  for 
change  in  volumetric  density.  Eurther 
investigation  is  needed  to  determine 
whether  rough  estimates  of  dense  tissue 
volume  can  improve  the  correlation  be¬ 
tween  breast  density  and  breast  cancer 
risk. 

In  conclusion,  the  MDEST-derived 
densities  compared  favorably  to  radiol¬ 
ogist  estimates  of  percent  breast  density 
and  were  more  reproducible  than  radi¬ 
ologist  estimates  of  the  conventional 
qualitative  BI-BADS  density  categories. 
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Qiiatitative  and  quantitative  BI-RADS 
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Abstract.  Studies  have  demonstrated  a  strong  correlation  between 
mammographic  breast  density  and  breast  cancer  risk.  Mammographic  breast 
density  may  therefore  be  used  as  a  surrogate  marker  for  monitoring  the  response 
to  treatment  in  studies  of  breast  cancer  prevention  or  intervention  methods.  In 
this  study,  we  evaluated  the  accmacy  of  using  mammograms  for  estimating 
breast  density  by  analyzing  the  correlation  between  the  percent  mammographic 
dense  area  and  the  percent  glandular  tissue  volume  as  estimated  from  MR 
images.  A  data  set  of  37  patients  who  had  corresponding  MR  images  and 
mammograms  was  collected.  The  glandular  tissue  regions  in  the  MR  slices 
were  segmented  by  a  semi-automatic  method  and  the  percent  glandular  tissue 
volume  calculated.  Mammographic  breast  density  was  estimated  by  an 
automated  image  analysis  program.  It  was  found  that  the  correlation  between 
the  percent  dense  area  of  the  CC  and  MLO  views  and  the  percent  volumetric 
fibroglandular  tissue  on  MR  images  was  0.93  and  0.91,  respectively,  with  a 
mean  bias  of  4.4%.  The  high  correlation  indicates  the  usefulness  of 
mammographic  density  as  a  surrogate  for  breast  density  estimation. 


1.  Introduction 

Previous  studies  have  shown  that  there  is  a  strong  positive  correlation  between 
breast  parenchymal  density  on  mammograms  and  breast  cancer  risk  [1-3].  The 
relative  risk  is  estimated  to  be  about  4  to  6  times  higher  for  women  whose 
mammograms  have  parenchymal  densities  over  60%  of  the  breast  area,  as  compared 
to  women  with  less  than  5%  of  parenchymal  densities.  The  change  in  mammographic 
breast  density  is  therefore  often  used  as  an  indicator  for  monitoring  the  effects  of 
preventive  or  interventional  treatment  of  breast  cancer. 

Breast  cancer  risk  is  expected  to  depend  on  the  volume  of  glandular  tissue  in  the 
breast.  Mammographic  density  is  a  projection  of  the  volume  of  glandular  tissue  onto 
the  two-dimensional  image  plane.  To  better  understand  the  correlation  between 
mammographic  density  and  breast  cancer  risk,  it  is  important  to  investigate  the 
relationship  between  the  projected  areal  density  on  mammograms  and  the  volume  of 
glandular  tissue  in  the  breast.  In  this  study,  we  investigate  this  relationship  by 
analyzing  the  percent  volumetric  glandular  tissue  in  magnetic  resonance  (MR)  images 
and  the  percent  dense  area  in  corresponding  mammograms  for  the  same  breasts. 
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2,  Materials  and  Methods 

Our  data  set  consisted  of  corresponding  MR  breast  images  and  mammograms  from 
37  patients  acquired  between  detection  and  biopsy.  The  MR  image  series  used  in  this 
study,  which  included  coronal  3D  Ti-weighted  pre-contrast  images  (coronal  sections 
2-5  mm  thick,  32  slices;  3D  Spoiled  Gradient-Recalled  Echo  (SPGR);  TE=3.3ms; 
TR=10ms,  Flip=40“,  matrix=256xl28,  FOV=28-32cm  right/left,  14-16cm 
superior/inferior,  scan  time=2  min  38  sec)  was  part  of  a  dynamic  breast  MRI  study. 
This  3D  SPGR  sequence  produced  full  volume  coverage  of  both  breasts  with 
contiguous  image  slices.  An  example  of  images  from  one  breast  is  shown  in  Fig.  1. 
Although  this  is  not  the  optimal  pulse  sequence  for  separating  water  and  fat,  the 
fibroglandular  parenchyma  (-water)  and  fatty  tissue  are  well  separated  with  this 
heavily  T i-weighted  acquisition  and  therefore  the  series  was  chosen  for  this  study. 
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Fig.  1 .  MR  images  of  the  right  breast  of  a  patient.  The  two-view  mammogram  of  the 
same  breast  is  shown  in  Fig.  2. 

We  have  developed  a  graphical  user  interface  that  displays  the  MR  series  and  the 
corresponding  mammogram  of  each  breast.  The  interface  allows  the  user  to  perform 
a  combination  of  manual  and  automatic  operations  to  segment  the  MR  images.  Each 
MR  slice  is  first  thresholded  to  separate  the  breast  from  the  surrounding  region.  For 
slices  close  to  the  chest  wall  where  no  clear  boundary  can  be  seen,  the  boundary  is 
manually  drawn  and  evaluated  by  radiologists.  Background  correction  [4]  using  the 
voxel  values  around  the  segmented  breast  region  is  employed  to  correct  for  the  non- 
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uniformity  across  the  breast  area  due  to  the  breast  coil.  The  histogram  of  the  voxel 
values  in  the  breast  region  is  then  formed  and  interactive  thresholding  is  used  to 
segment  the  fibroglandular  tissue  from  the  fatty  tissue.  A  morphological  erosion 
operation  along  the  breast  boundary  then  excludes  the  skin  voxels  from  the 
ealculation  of  the  fibroglandular  tissue  area  in  eaeh  sliee.  Finally,  an  integration  of 
the  fibroglandular  voxels  in  all  slices  relative  to  the  breast  volume  provides  the 
pereent  volumetric  fibroglandular  tissue  in  the  breast. 

We  have  previously  developed  an  automated  image  analysis  tool  (Mammographic 
Density  ESTimator)  to  assist  radiologists  in  estimating  mammographie  breast  density 
[5].  MDEST  performs  dynamic  range  compression,  breast  boundary  tracking, 
pectoral  muscle  segmentation  for  the  MLO  view,  automatie  thresholding  based  on 
gray  level  histogram  analysis,  and  calculates  the  percent  dense  area  on  a 
mammogram.  We  found  that  the  correlation  between  the  eomputer-estimated  pereent 
dense  area  and  radiologists’  manual  segmentation  was  0.94  and  0.91,  respectively,  for 
CC  and  MLO  views,  with  a  mean  bias  of  less  than  2%.  An  example  of  a 
mammogram  segmented  by  MDEST  is  shown  in  Fig.  2. 


Fig.  2.  Automated  breast  density  segmentation  from 
Lower  row:  MLO  view. 


mammograms.  Upper  row:  CC  view. 


3.  Results 

Scatter  plots  of  the  percent  volumetric  fibroglandular  tissue  versus  the  percent 
dense  area  on  mammograms  are  shown  in  Fig.  3(a)  and  3(b)  for  the  CC-  and  MLO- 
view  mammograms,  respectively.  The  correlation  of  percent  dense  area  of  the  CC  and 
MLO  views  with  the  percent  volumetric  fibroglandular  tissue  on  MR  images  was 
found  to  be  0.93  and  0.91,  respectively,  with  a  mean  bias  of  4.4%. 
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%  Area-CC  View  %  Area-MLO  View 


Fig.  3.  Correlation  of  %  volumetric  fibroglandular  tissue  on  MR  images  with  %  dense  area  on 
mammograms  for  37  patients.  The  left  and  right  breasts  are  plotted  as  separate  data 
points  on  each  graph.  The  dash  lines  are  linear  least  squares  fits  to  the  data  points. 


4.  Conclusion 

Mammographic  density  is  highly  correlated  with  the  volumetric  fibroglandular 
tissue  in  the  breast,  indicating  its  usefulness  as  a  surrogate  for  breast  density 
estimation.  The  computerized  image  analysis  tool,  MDEST,  is  useful  for  estimation  of 
mammographic  density.  The  automated  analysis  is  expected  to  contribute  to  the 
understanding  of  the  relationship  of  mammographic  density  to  breast  cancer  risk, 
detection,  and  prognosis,  and  to  the  prevention  and  treatment  of  breast  cancer. 
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Abstract.  Studies  have  demonstrated  a  strong  correlation  between 
mammographic  breast  density  and  breast  cancer  risk.  We  have 
previously  developed  a  computerized  system,  mammographic  density 
estimator  (MDEST),  to  estimate  breast  density  automatically  on 
digitized  film  mammograms  (DFM).  In  this  study,  we  evaluated  the 
performance  of  the  MDEST  system  on  full  field  digital  mammograms 
(FFDM)  and  DFMs.  The  input  to  the  system  is  a  preprocessed  dynamic 
range  compressed  image.  The  breast  region  is  first  segmented  by  breast 
boundary  detection.  The  pectoral  muscle  is  trimmed  if  it  is  an  MFO 
view.  A  rule-based  classifier  is  then  used  to  classify  the  breast  image 
into  one  of  four  classes  according  to  the  characteristics  of  its  gray  level 
histogram.  The  dense  area  from  the  breast  region  is  subsequently 
segmented  by  automatic  gray  level  thresholding.  The  breast  density  is 
estimated  as  the  percentage  of  the  segmented  dense  area  relative  to  the 
breast  area.  In  this  study,  two-view  FFDM  and  the  corresponding  DFM 
from  99  patients  with  202  images  in  each  set  were  used.  The  dense  area 
on  each  mammogram  was  segmented  by  4  radiologists  using  interactive 
thresholding  and  their  average  was  used  as  the  “gold  standard”.  The 
MDEST  system  was  directly  applied  to  the  FFDM  and  DFM  data 
without  any  re-training  except  that  the  preprocessing  filter  was  modified 
for  FFDMs.  We  found  that  the  correlation  between  the  estimated  percent 
dense  area  and  the  gold  standard  was  0.850  and  0.873  on  FFDM,  and 
0.885  and  0.824  on  DFM,  for  CC  and  MFO  views,  respectively.  The 
results  demonstrated  the  feasibility  of  estimating  breast  density 
automatically  on  FFDM  and  DFM  using  the  same  MDEST  system. 


1.  Introduction 

Studies  have  demonstrated  a  strong  correlation  between  breast  density  on  mammograms  and 
breast  cancer  risk  (Saftlas  and  Szklo  1987;  Brisson  et  al.  1989;  Saftlas  et  al.  1991;  Oza  and 
Boyd  1993;  Boyd  et  al.  1998;  Yaffe  et  al.  1998).  The  relative  risk  is  estimated  to  be  about  4 
to  6  times  higher  for  women  whose  mammograms  have  parenchymal  densities  over  60%  of 
the  breast  area,  as  compared  to  women  with  less  than  5%  of  parenchymal  densities.  The 
strong  correlation  between  breast  density  and  breast  cancer  risk  has  prompted  researchers  to 
use  mammographic  density  as  an  indicator  for  monitoring  the  effects  of  preventive  or 
interventional  treatment  of  breast  cancer. 

Because  of  the  subjective  nature  of  visual  analysis,  qualitative  estimation  may  vary 
from  radiologist  to  radiologist.  A  computerized  method  for  measuring  mammographic 
density  would  be  useful  as  a  supplement  to  the  radiologist’s  assessment.  We  have  previously 
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developed  a  computerized  system,  mammographic  density  estimator  (MDEST)  (Zhou  et  al. 
2001),  to  estimate  breast  density  automatically  on  digitized  fdm  mammograms  (DFM).  The 
MDEST  system  performs  dynamic  range  compression,  breast  boundary  tracking,  pectoral 
muscle  trimming  for  MEO  view,  gray  level  thresholding  based  on  histogram  analysis,  and 
calculation  of  the  percent  dense  area  on  the  mammogram.  In  the  previous  study,  260  digitized 
4-view  mammograms  of  65  patients  were  used.  The  gold  standard  of  percent  dense  area  of 
the  breast  region  for  each  mammogram  was  obtained  by  averaging  five  radiologists’ 
manually  segmented  percent  dense  area.  We  found  that  the  correlation  between  the  computer- 
estimated  percent  dense  area  and  radiologists’  manual  segmentation  was  0.94  and  0.91,  with 
RMS  errors  at  6.1%  and  7.2%,  respectively,  for  CC  and  MEO  views. 

In  this  study,  we  investigate  the  feasibility  of  computerized  mammogramic  density 
estimation  on  FFDMs  and  DFMs  using  the  same  image  segmentation  system.  The  MDEST 
system  was  directly  applied  to  the  FFDM  and  the  corresponding  DFM  without  any  re-training 
except  that  the  preprocessing  filter  was  modified  for  FFDMs.  The  performance  was  evaluated 
by  analyzing  the  correlation  between  the  computer-estimated  mammographic  density  and  the 
gold  standard  obtained  by  radiologists’  interactive  thresholding. 


2.  Materials  and  Methods 

The  data  sets  consisting  of  FFDM  and  the  corresponding  DFM  of  99  patients  with  202 
images  in  each  set  were  used.  Each  case  contains  the  craniocaudal  (CC)  view  and  the 
mediolateral  oblique  (MEO)  view.  The  FFDM  was  acquired  with  a  GE  Senographe  2000D 
system  and  the  raw  GE  FFDM  was  processed  by  a  Faplacian  pyramid  multi-resolution 
preprocessing  method  (Wei  et  al.  2004).  The  preprocessed  image  was  downsized  to  a  pixel 
size  of  800  pm  x  800  pm  image  and  4096  gray  levels.  The  DFM  was  acquired  with 
mammography  systems  approved  by  the  Mammography  Quality  Standards  Act  (MQSA)  and 
was  digitized  with  a  FUMISYS  85  laser  film  scanner  with  a  pixel  size  of  50  pm  x  50  pm  and 
4096  gray  levels.  The  digitized  mammogram  was  also  downsized  to  a  800  pm  x  800  pm 
image  using  a  16x16  box  filter. 

Our  previously  developed  computerized  system  MDEST  was  applied  to  the  FFDMs 
and  DFMs  to  estimate  the  mammographic  density  without  any  re-training.  The  density 
estimation  was  performed  in  three  stages:  breast  region  segmentation,  image  enhancement, 
and  gray  level  thresholding  based  on  histogram  analysis  .  First,  the  breast  region  was 
segmented  from  the  surrounding  background  by  an  automated  breast  boundary  tracking 
algorithm  for  DFM.  For  FFDM,  thresholding  was  used  to  separate  the  breast  region  from  the 
background.  Since  our  current  pectoral  muscle  trimming  program  is  not  100%  accurate,  the 
pectoral  muscle  was  manually  trimmed  on  the  MEO  view  images  for  both  DFM  and  FFDM 
in  this  study  in  order  to  separate  the  errors  due  to  breast  density  segmentation  from  those  due 
to  pectoral  muscle  trimming.  Second,  an  adaptive  dynamic  range  compression  technique  was 
applied  to  enhance  the  DFM  image.  For  FFDM  image,  a  Faplacian  pyramid  multi-resolution 
preprocessing  method  (Wei  et  al.  2004)  was  used  for  image  enhancement.  At  the  third  stage, 
for  both  FFDM  and  DFM,  rule-based  classification  was  used  to  classify  the  breast  image  into 
one  of  four  classes  according  to  the  characteristic  features  of  its  gray  level  histogram  (Zhou  et 
al.  2001).  For  each  image  in  the  classified  classes,  a  gray  level  threshold  was  determined 
adaptively  to  segment  the  dense  area  from  the  breast  region.  The  breast  density  was  estimated 
as  the  percentage  of  the  segmented  dense  area  relative  to  the  breast  area.  As  an  example, 
typical  mammograms  in  the  four  classes  with  the  corresponding  enhanced  images, 
histograms,  selected  thresholds  and  the  segmented  image  are  shown  in  Figure  1.  To  evaluate 
the  performance  of  MDEST,  the  computer  segmentation  results  were  compared  to  those  by 
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manual  segmentation  with  interactive  thresholding  by  four  MQSA  radiologists.  The  “gold 
standard”  of  percent  dense  area  for  each  mammogram  was  obtained  by  averaging  the 
manually  segmented  percent  dense  areas  of  the  four  radiologists. 

3.  Results 

Figures  2(a)-(d)  show  the  comparison  of  the  percent  dense  area  between  the  estimation  by  the 
MDEST  system  and  the  gold  standard  on  FFDM  and  DFM  for  CC-  and  MLO-view 
mammograms,  respectively.  Table  1  summarizes  the  comparison  between  the  MDEST 
performance  and  the  gold  standard  for  FFDM  and  DFM,  respectively.  The  correlation 
between  the  computer-estimated  percent  dense  area  and  the  gold  standard  is  0.850  and  0.873 


Figure  1.  Typical  mammograms  in  the  four  density  classes  and  the  corresponding  enhanced 
and  segmented  images,  histograms  and  thresholds.  The  columns  from  left  to  right 
correspond  to  the  original  image,  enhanced  image,  segmented  image  and  the 
histogram.  Rows  from  top  to  bottom  correspond  to  class  one  to  four. 
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%  Breast  Density  (Computer) 


%  Breast  Density  (Computer) 


%  Breast  Density  (Computer)  %  Breast  Density  (Computer) 

(C)  (d) 

Figure  2.  Comparison  of  the  percent  dense  area  between  the  estimation  by  the 
MDEST  system  and  the  gold  standard.  The  dashed  line  represents  the 
linear  regression  of  the  data  on  the  plot,  (a)  FFDM  CC  view,  (b)  FFDM 
MLO  view,  (c)  DFM  CC  view,  (d)  DFM  MLO  view. 


Table  L  The  correlation  and  RMS  difference  between  the  estimated  percent 
dense  area  by  the  MDEST  system  and  the  gold  standard  (average  of  four 
MQSA  radiologists). 


Image  subsets 

FFDM 

DFM 

Correlation 

RMS  Error 

Correlation 

RMS  Error 

CC  view 

0.850 

7.26% 

0.885 

6.87% 

MEO  view 

0.873 

5.70% 

0.824 

8.16% 

All  images 

0.859 

6.52% 

0.855 

7.54% 

on  FFDM,  and  0.885  and  0.824  on  DFM,  for  CC  and  MEO  views,  respectively.  For  all  of  the 
images  combining  CC-  and  MEO-views,  the  correlation  is  0.859  and  0.855  on  FFDM  and 
DFM,  respectively.  The  RMS  difference  in  the  percent  dense  area  between  the  MDEST 
estimation  and  the  gold  standard  is  7.26%,  5.70%  and  6.52%  on  FFDM,  and  6.87%,  8.16% 
and  7.54%  on  DFM  for  CC-view  alone,  MEO-view  alone,  and  combined  CC  and  MEO- 
views,  respectively. 
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4.  Conclusion 

Our  preliminary  study  demonstrated  that  the  estimation  of  mammographic  density  could  be 
performed  efficiently  by  the  automated  image  analysis  tool.  The  computer-estimated  percent 
dense  area  had  a  high  correlation  with  the  gold  standard  obtained  from  averaging  four  MQSA 
radiologists’  manual  segmentation.  The  results  also  demonstrated  the  feasibility  of  estimating 
breast  density  automatically  on  FFDM  and  DFM  using  the  same  MDEST  system.  Further 
study  will  be  conducted  to  improve  the  breast  density  segmentation  accuracy. 
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ABSTRACT 

Automatic  identification  of  the  pectoral  muscle  on  MLO  view  is  an  essential  step  for  computerized  analysis  of 
mammograms.  It  can  reduce  the  bias  of  mammographic  density  estimation,  will  enable  region-specific  processing  in 
lesion  detection  programs,  and  also  may  be  used  as  a  reference  in  image  registration  algorithms.  We  are  developing  a 
computerized  method  for  the  identification  of  pectoral  muscle  on  mammograms.  The  upper  portion  of  the  pectoral 
edges  was  first  detected  to  estimate  the  direction  of  the  pectoral  muscle  boundary.  A  gradient-based  directional  (GD) 
filter  was  used  to  enhance  the  linear  texture  structures,  and  then  a  gradient-based  texture  analysis  was  designed  to 
extract  a  texture  orientation  image  that  represented  the  dominant  texture  orientation  at  each  pixel.  The  texture 
orientation  image  was  enhanced  by  a  second  GD  filter.  An  edge  flow  propagation  method  was  developed  to  extract 
edges  around  the  pectoral  boundary  using  geometric  features  and  anatomic  constraints.  The  pectoral  boundary  was 
finally  generated  by  a  second-order  curve  fitting.  118  MLO  view  mammograms  were  used  in  this  study.  The  pectoral 
muscle  boundary  identified  on  each  image  by  an  experienced  radiologist  was  used  as  the  gold  standard.  The  accuracy  of 
pectoral  boundary  detection  was  evaluated  by  two  performance  metrics.  One  is  the  overlap  percentage  between  the 
computer-identified  area  and  the  gold  standard,  and  the  other  is  the  root-mean-square  (RMS)  distance  between  the 
computer  and  manually  identified  pectoral  boundary.  For  118  MLO  view  mammograms,  99.2%  (117/118)  of  the 
pectoral  muscles  could  be  identified.  The  average  of  the  overlap  percentage  is  94.8%  with  a  standard  deviation  of 
20.9%,  and  the  average  of  the  RMS  distance  is  4.3  mm  with  a  standard  deviation  of  5.9  mm.  These  results  indicate  that 
the  pectoral  muscle  on  mammograms  can  be  detected  accurately  by  our  automated  method. 

Keywords:  Computer-aided  detection.  Pectoral  muscle  trimming.  Breast  density  estimation.  Directional  gradient  filter 


1.  INTRODUCTION 

Breast  cancer  is  one  of  the  leading  causes  of  cancer  mortality  among  women*’  At  present,  the  most  successful  method 
for  the  early  detection  of  breast  cancer  is  screening  mammography^  It  has  been  demonstrated  that  an  effective 
computer-aided  diagnosis  (CAD)  system  can  provide  a  second  opinion  to  the  radiologists  and  improve  the  accuracy  of 
detection  and  characterization  of  mammographic  abnormalities,  which,  in  turn,  may  reduce  unnecessary  biopsies. 
Studies  have  shown  that  there  is  a  strong  positive  correlation  between  breast  parenchymal  density  on  mammograms  and 
breast  cancer  risk.*’  "*’®  The  relative  risk  is  estimated  to  be  about  4-6  times  higher  for  women  whose  mammograms  have 
parenchymal  densities  over  60%  of  the  breast  area,  as  compared  to  women  with  less  than  5%  of  parenchymal  densities. 
Mammograms  are  analyzed  visually  by  radiologists,  the  qualitative  response  may  vary  from  radiologist  to  radiologist 
due  to  the  subjective  nature  of  visual  analysis.  We  have  previously  developed  a  computerized  system,  mammographic 
density  estimator  (MDEST),  to  estimate  breast  density  automatically  on  digitized  film  mammograms.’  For  each 
mammogram,  the  breast  region  was  first  segmented  by  breast  boundary  detection  and,  for  the  mediolateral  oblique 
(MLO)  view,  with  additional  pectoral  muscle  trimming.  A  gray  level  threshold  was  then  automatically  determined  to 
segment  the  dense  tissue  from  the  breast  region.  The  breast  density  was  estimated  as  the  percentage  of  the  segmented 
dense  area  relative  to  the  breast  area.  Our  preliminary  study  indicated  that  the  computer-estimated  mammographic 
breast  density  correlated  closely  with  the  “reference  standard”  obtained  by  averaging  five  experienced  radiologists’ 
manual  segmentations  and  the  average  bias  was  much  less  than  that  of  the  radiologists’  visual  estimation. 

Automatic  identification  of  the  pectoral  muscle  is  an  essential  step  for  computerized  analysis  of  mammograms. 
Accurate  segmentation  of  the  pectoral  muscle  on  MLO-view  mammograms  can  reduce  the  bias  of  mammographic 
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density  estimation  and  improve  the  performance  of  our  MDEST  method.  It  will  enable  region-specific  processing  in 
lesion  detection  programs  to  reduce  false  negatives.  False  positives  can  be  reduced  if  the  detected  objects  in  the  pectoral 
muscle  area  can  be  selectively  suppressed.  The  identification  of  the  pectoral  muscle  may  also  be  used  as  a  reference  in 
image  registration  algorithm  for  multiple-view  analysis  of  mammograms. 

In  our  preliminary  study  ,  the  pectoral  muscle  was  trimmed  using  a  gradient-based  pectoral  edge  detection  method:  the 
initial  edge  in  the  pectoral  region  was  first  found  as  the  maximum  gradient  point  by  a  line-by-line  gradient  analysis  from 
the  chest  wall  to  the  breast  boundary.  An  edge  validation  process  was  then  performed  to  remove  the  false  pectoral 
muscle  edges  using  a  line  fitting  method,  and  a  coarse  direction  of  the  pectoral  edges  was  estimated  from  the  validated 
edges.  The  remaining  pectoral  edges  were  extrapolated  along  the  estimated  pectoral  direction.  Finally,  a  second  order 
curve  was  fitted  to  the  detected  pectoral  edges  to  generate  the  pectoral  boundary.  Using  the  above  method,  74.6%  of  the 
pectoral  muscles  were  determined  by  visual  judgment  to  be  correctly  identified  in  this  preliminary  study. 

The  purpose  of  this  study  is  to  improve  the  performance  of  our  previously  developed  pectoral  muscle  segmentation 
method.  Accurate  identification  of  the  pectoral  muscle  on  mammograms  is  challenging,  especially  for  the  improperly 
positioned  MFO-view  images  and  the  images  containing  dense  glandular  tissues  overlapping  with  the  pectoral  muscle 
region.  In  this  work,  we  developed  a  two-stage  gradient-based  texture  analysis  method  to  detect  the  pectoral  boundary. 
In  the  first  stage,  linear  texture  structures  were  enhanced  and  the  directional  gradients  were  computed  using  a 
directional  filter.  In  the  second  stage,  a  texture  orientation  image  was  derived  as  the  dominant  texture  orientation  at  each 
pixel.  A  diffusion  filter  was  used  to  estimate  the  global  direction  of  the  pectoral  boundary.  An  edge  flow  propagation 
method  was  developed  to  extract  the  pectoral  edges  with  the  guidance  of  the  estimated  global  direction. 

2.  MATERIALS  AND  METHODS 


2.1  Materials 

In  this  study,  118  MFO-view  mammograms  from  103  patients  were  randomly  selected  from  the  patient  files  in  the 
Radiology  Department  at  the  University  of  Michigan.  Data  collection  was  approved  by  the  Institutional  Review  Board 
and  individual  patient  informed  consent  was  waived.  The  mammograms  were  acquired  with  Mammography  Quality 
Standards  Act  (MQSA)  approved  GF  DMR  (Milwaukee,  Wisconsin)  mammography  units  using  Kodak  MR2000 
screen/film  systems.  All  films  were  digitized  with  a  FUMISYS  85  laser  film  scanner  with  a  pixel  size  of  50  /jmx50  pm 
and  4096  gray  levels.  The  resolution  of  the  mammograms  was  reduced  to  800  pm  x  800  pm  for  segmentation  of  the 
pectoral  muscle. 

2.2  Pectoral  muscle  identification 

Figure  1  summarizes  the  automatic  pectoral  muscle  identification  scheme.  The  interference  due  to  overlapping  of  the 
glandular  tissue  on  the  pectoral  muscle  region  is  first  reduced  by  smoothing  the  mammogram  using  an  edge  preserving 
anisotropic  diffusion  filter  Because  less  glandular  tissue  appears  at  the  upper  region  of  the  pectoral  muscle,  the  upper 
portion  of  the  pectoral  boundary  usually  remains  sharp  after  smoothing  and  can  be  detected  robustly  by  searching  the 
maximum  horizontal  gradients  on  the  diffused  image.  The  extrapolation  of  the  detected  upper  pectoral  boundary 
provides  a  coarse  global  direction  of  the  pectoral  boundary.  To  refine  the  entire  pectoral  boundary,  a  gradient-based 
directional  (GD)  filter  was  first  employed  to  enhance  the  linear  texture  structures  on  the  mammogram.  The  orientation 
of  the  digitized  image  could  be  automatically  determined  by  the  curvature  of  the  breast  boundary.  For  example,  if  the 
image  was  positioned  such  that  the  chest  wall  was  on  the  right  side,  it  could  be  assumed  that  the  pectoral  boundary  is  at 
a  direction  approximately  from  the  top-left  to  the  bottom-right  with  less  than  45  degree  deviation.  Therefore,  in  our 
study,  the  kernel  of  the  GD  filter  was  designed  as  a  step  function  with  45  degree  orientation.  After  the  pectoral  edge  was 
enhanced  by  the  GD  filter,  a  gradient-based  texture  analysis®  was  used  to  compute  an  orientation  image  which 
represented  the  dominant  texture  orientation  at  each  pixel.  The  orientation  image  was  smoothed  using  an  edge 
preserving  mean  shift  algorithm*®  that  iteratively  shifted  each  pixel  to  the  average  of  the  pixels  in  its  neighborhood.  The 
texture  patterns  with  dominant  texture  orientations  directing  from  the  top-left  to  the  bottom-right,  which  were  more 
likely  to  be  the  pectoral  edges,  were  enhanced  by  applying  a  second  GD  filter  to  the  smoothed  orientation  image. 
Candidate  edges  of  the  pectoral  muscle  were  detected  on  the  enhanced  orientation  image  using  a  ridge-tracking 
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algorithm.  The  ridges  were  tracked  by  searching  for  the  local  maximum  along  the  coarse  global  direction  estimated,  as 
described  above,  by  the  upper  pectoral  boundary  on  the  anisotropic  diffused  image.  With  the  guidance  of  the  estimated 
global  direction  of  the  pectoral  boundary  and  the  anatomical  constraints,  an  edge  flow  propagation  algorithm  was  then 
used  to  extract  the  boundary  points  of  the  pectoral  muscle  by  pruning  the  edges  that  are  less  likely  to  lie  on  the  pectoral 
boundary.  A  second  order  curve  fitting  was  finally  used  to  generate  the  pectoral  muscle  boundary.  Figure  2  shows 
examples  of  the  intermediate  images  of  pectoral  boundary  enhancement  and  edge  tracking  corresponding  to  the  various 
stages  shown  in  the  flowchart  in  Figure  1 . 


3.  RESULTS 

An  experienced  MQSA-radiologist  used  a  graphical  user  interface  to  manually  draw  the  pectoral  muscle  boundary  on 
each  MLO-view  mammogram,  which  was  then  used  as  the  gold  standard  for  the  evaluation  of  the  performance  of  our 
pectoral  muscle  detection  program. 

For  each  MLO  view  mammogram,  the  accuracy  of  pectoral  boundary  detection  was  evaluated  by  two  performance 
metrics:  the  percentage  of  overlap,  defined  as  the  ratio  of  the  overlap  area  between  the  computer  detected  pectoral 
muscle  area  and  the  gold  standard  relative  to  the  gold  standard,  and  the  root-mean-square  (RMS)  distance  obtained  by 
calculating  the  shortest  distance  point  by  point  between  the  computer-identified  pectoral  boundary  and  the  manually 
marked  pectoral  boundary.  For  the  data  set  of  118  MLO  view  mammograms,  99.2%  (117/118)  of  the  pectoral  muscles 
could  be  identified,  the  average  of  the  percent  overlap  area  is  94.8%  with  a  standard  deviation  of  20.9%,  the  average  of 
the  RMS  distance  is  4.3  mm  with  a  standard  deviation  of  5.9  mm. 

Figure  3  shows  some  examples  of  pectoral  boundary  identification  on  mammograms.  The  computer  identified  pectoral 
boundaries  were  shown  in  white  lines  and  the  dark  lines  show  the  radiologist’s  hand  drawn  boundaries.  Figure  3  (a)-(b) 
show  the  pectoral  boundary  can  be  identified  accurately  on  mammograms  with  weak  pectoral  edges  (figure  3(a) )  and  a 
large  area  of  dense  tissues  overlapping  on  the  pectoral  muscle  area  (shown  in  figure  3(b)  ).  Figure  3(c)-(d)  show  two 
examples  of  less  accurate  pectoral  boundaries  detected  by  the  computer.  Figure  3(e)  shows  the  only  case  in  this  data  set 
that  the  computer  failed  to  detect  the  boundary. 


4.  CONCLUSION 

The  newly  developed  gradient-based  directional  filter  and  the  dominant  texture  orientation  estimation  method  can 
enhance  the  pectoral  boundary  regions.  The  edge  flow  propagation  method  can  accurately  extract  pectoral  edges  to 
generate  the  pectoral  boundary.  Automatic  pectoral  muscle  identification  will  provide  the  foundation  for  many 
mammographic  image  analysis  tasks  in  CAD  applications. 
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Figure  1 .  Automated  pectoral  muscle  detection  scheme 


Proc.  of  SPIE  Vol.  5747  855 


(g)  (h) 


Figure  2.  Example  of  boundary  enhancement  and  segmentation  of  pectoral  muscle,  (a)  original  image;  (b)  texture 
orientation  image  after  first  GD  filter  and  texture-flow  analysis;  (c)  ridge  image  enhanced  by  the  2'“* 
GD  filter;  (d)  tracked  ridges;  (e)  smoothed  image  using  anisotropic  diffusion  filter;  (f)  initial  pectoral 
edges  detected  from  the  smoothed  image  in  (e)  for  the  estimation  of  the  coarse  direction  of  the  pectoral 
boundary;  (g)  propagated  pectoral  edges  on  the  ridge  image  (c)  with  the  guidance  of  the  coarse 
direction  estimated  from  the  smoothed  image  shown  in  (f);  (h)  the  final  identified  pectoral  boundary 
after  2"^  order  curve  fitting. 
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Figure  3.  Examples  of  pectoral  boundary  segmentation  on  mammograms,  (a)-(b):  accurate  identification  of 


pectoral  boundary;  (c)-(d);  less  accurate  identification  of  pectoral  boundary;  (e)  the  only 
mammogram  in  our  data  set  that  the  computer  failed  to  identify  the  pectoral  muscle  due  to  the 
small  portion  of  the  pectoral  muscle  area  within  the  breast  region. 
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Breast  Density  Estimation:  Correlation  of  Mammographic  Density  and  MR 

Volumetric  Density 

Chan  HP,  Hadjiiski  LM,  Roubidoux  MA,  Helvie  MA,  Paquerault  S,  Sahiner  B, 

Chenevert  T,  Goodsitt  MM 

Studies  have  demonstrated  a  strong  correlation  between  mammographic  breast 
density  and  breast  cancer  risk.  Mammographic  breast  density  may  therefore  be  used 
as  a  surrogate  marker  for  monitoring  the  response  to  treatment  in  studies  of  breast 
cancer  prevention  or  intervention  methods.  In  this  study,  we  evaluated  the  accuracy 
of  using  mammograms  for  estimating  breast  density  by  analyzing  the  correlation 
between  the  mammographic  areal  density  and  the  glandular  tissue  volume  as 
estimated  from  MRI. 

A  data  set  of  fifty  patients  who  had  corresponding  MR  images  and  mammograms  was 
collected.  The  coronal  MR  images  provided  full  volume  coverage  of  both  breasts  with 
contiguous  image  sections.  Heavily  T1 -weighted  acquisition  was  used  which 
produced  images  with  well-separated  dense  parenchyma  and  adipose  tissue.  The 
glandular  tissue  regions  in  the  MR  sections  were  outlined  interactively  by 
experienced  radiologists  with  a  graphical  user  interface  and  the  percent  glandular 
tissue  volume  calculated.  Mammographic  density  was  estimated  with  an  automated 
image  analysis  program  and  compared  to  that  provided  by  manual  segmentation.  The 
computer  algorithm  included  dynamic  range  compression,  breast  boundary  tracking, 
and  automatic  thresholding  based  on  analysis  of  the  gray  level  histogram.  The  dense 
tissue  regions  on  the  mammogram  were  then  segmented  and  the  percent  dense  area 
estimated. 

The  correlation  between  the  computer-estimated  percent  dense  area  and  radiologists’ 
manual  segmentation  of  mammographic  density  was  found  to  range  from  0.91  to  0.94 
in  an  independent  set  of  mammograms.  The  analysis  of  the  relationship  between 
mammographic  percent  dense  area  and  MR  percent  glandular  tissue  volume  is 
underway.  We  will  discuss  the  correlation  of  the  breast  fibroglandular-to-adipose 
ratios  estimated  from  these  two  approaches. 
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Chuan  Zhou,  Mitchell  Goodsitt,  Berkman  Sahiner,  Marilyn  Roubidoux 
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Mammographio  breast  density,  an  indicator  of  the  proportion  of  fibroglandular  vs.  fatty 
tissue  in  the  breast,  has  been  found  to  have  strong  correlation  with  breast  cancer  risk. 
Mammographio  breast  density  has  therefore  been  used  for  monitoring  the  response  in 
studies  of  preventive  or  interventional  treatment  of  breast  cancer.  Breast  density  changes 
during  the  course  of  treatment  are  often  estimated  visually  on  mammograms  by 
radiologists;  which  involves  large  inter-  and  intraobserver  variations.  The  goal  of  this 
project  is  to  develop  an  automated  image  analysis  method  that  can  provide  a  more 
consistent  and  reproducible  estimate  of  the  percent  dense  breast  area  on  a  mammogram. 

An  automated  computer  program  has  been  developed  that  performs  breast  density  analysis 
using  the  following  steps:  detection  of  the  breast  boundary,  reduction  of  the  image  dynamic 
range,  analysis  and  classification  of  the  shape  of  the  gray  level  histogram,  adaptive  gray 
level  thresholding,  and  estimation  of  the  percent  dense  tissue  area  relative  to  the  breast  area. 
The  performance  of  the  algorithm  was  evaluated  by  comparing  the  computer  segmentation 
results  to  manual  segmentation  with  interactive  thresholding  by  five  radiologists. 

To  further  investigate  the  relationship  between  the  mammographic  breast  density  and  the 
amount  of  fibroglandular  tissue  in  the  breast,  the  image  analysis  program  was  applied  to  the 
mammograms  of  37  patients  who  had  corresponding  magnetic  resonance  (MR)  images  of 
the  breasts.  The  fibroglandular  tissue  regions  in  the  MR  slices  were  segmented  interactively 
with  a  user  interface,  and  the  percentage  of  fibroglandular  tissue  volume  in  the  breast 
estimated.  The  correlation  between  the  percent  dense  area  estimated  from  mammograms 
and  the  percent  volumetric  fibroglandular  tissue  estimated  from  MR  images  was  studied. 

We  found  that  the  correlation  between  the  computer-estimated  percent  dense  area  and  the 
average  of  the  five  radiologists’  manual  segmentation  was  0.94  and  0.91,  respectively,  for 
CC  and  MLO  views,  with  a  mean  bias  of  less  than  2%.  The  percent  breast  dense  area  of  the 
CC  and  MLO  views  has  a  correlation  of  0.92  and  0.91,  respectively,  with  the  percent 
volumetric  fibroglandular  tissue  on  MR  images.  Mammographic  density  is  therefore  highly 
correlated  with  the  volumetric  fibroglandular  tissue  in  the  breast,  indicating  its  usefulness  as 
a  surrogate  for  breast  density  estimation. 


The  computerized  image  analysis  tool  is  useful  for  breast  density  estimation  on 
mammograms.  The  automated  analysis  is  expected  to  contribute  to  the  understanding  of  the 
relationship  of  mammographic  density  to  breast  cancer  risk,  detection,  and  prognosis,  and  to 
the  prevention  and  treatment  of  breast  cancer. 


The  U.S.  Army  Medical  Research  Materiel  Command  under  DAMD17-01-1-0326 
supported  this  work. 


A3 

Breast  Density  Estimation  on  Mammograms  and  MR  Images:  A  Tool  for  Assessment 

of  Breast  Cancer  Risk 


Jun  Wei,  Heang-Ping  Chan,  Mark  A.  Helvie,  Lubomir  M.  Hadjiiski, 

Berkman  Sahiner,  Marilyn  A.  Roubidoux,  Chuan  Zhou,  Sophie  Paquerault, 

Thomas  Chenevert,  Mitchell  M.  Goodsitt 

PURPOSE:  Previous  studies  have  found  that  mammographic  breast  density  is  highly 
correlated  with  breast  cancer  risk.  We  have  developed  a  computerized  image  analysis  tool. 
Mammography  Density  ESTimator  (MDEST),  to  estimate  the  percent  dense  area  on 
mammograms.  In  this  study,  we  analyzed  the  correlation  between  mammographic  percent 
dense  area  and  percent  volumetric  fibroglandular  tissue  on  MR  images. 

METHOD  AND  MATERIAES:  For  the  estimation  of  mammographic  breast  density, 
MDEST  performs  the  following  procedures:  detection  of  the  breast  boundary,  reduction  of 
the  image  dynamic  range,  analysis  and  classification  of  the  gray  level  histogram,  adaptive 
gray  level  thresholding,  and  estimation  of  the  percent  dense  tissue  area  relative  to  the 
breast  area.  The  performance  of  MDEST  was  validated  by  comparing  its  segmentation  to 
that  with  manual  interactive  thresholding  by  five  radiologists  in  260  mammograms.  For 
the  estimation  of  the  percent  volume  of  fibroglandular  tissue  in  breast  MR  images,  a  semi¬ 
automatic  method  has  been  developed  to  segment  the  fibroglandular  tissue  from  each  slice. 
First,  the  breast  boundary  is  detected  automatically.  A  deformable  model  and  manual 
modification  are  used  to  correct  for  incorrectly  detected  boundaries  that  usually  occur  in 
slices  near  the  chest  wall  where  the  breast  boundary  is  not  well-defined.  Because  of  the 
nonuniformity  of  the  breast  coil,  the  signal  intensity  in  the  breast  region  is  not  uniform 
across  the  field  of  view.  A  background  correction  technique  that  estimates  the  low 
frequency  background  from  the  gray  levels  along  the  breast  boundary  is  developed  to 
reduce  the  nonuniformity.  Finally,  manual  interactive  thresholding  of  the  gray  level 
histogram  in  the  breast  region  is  used  to  separate  the  fibroglandular  from  the  fatty  region. 
The  tissue  volume  is  calculated  by  integration  over  all  slices  containing  the  breast.  A  data 
set  of  54  cases  having  MR  images  and  corresponding  4-view  mammograms  was  used  in 
this  study.  The  MR  images  were  coronal  3D  SPGR  T 1 -weighted  pre-contrast  images. 

RESUETS:  The  percent  volume  of  fibroglandular  tissue  had  a  correlation  of  0.92  and 
0.91  with  the  percent  dense  area  obtained  on  CC-view  and  MEO-view  mammograms, 
respectively.  The  percent  mammographic  dense  area  slightly  overestimates  the  percent 
volume  with  a  mean  bias  of  3%. 

CONCEUSIONS:  Mammographic  density  is  highly  correlated  with  the  volumetric 
fibroglandular  tissue  in  the  breast,  indicating  its  usefulness  as  a  surrogate  for  breast 
density  estimation  and  thus  for  monitoring  breast  cancer  risk. 
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Comparison  of  mammographic  density  estimated  on  digital  mammograms  and  screen-film 

mammograms 

Heang-Ping  Chan,  Jun  Wei,  Chuan  Zhou,  Mark  A.  Helvie,  Marilyn  Roubidoux,  Janet  Bailey, 

Lubomir  Hadjiiski,  Berkman  Sahiner 

PURPOSE:  To  eompare  breast  density  estimated  on  pairs  of  digital  mammogram  (DM)  and  sereen- 
film  mammogram  (SFM)  obtained  from  the  same  patients. 

METHODS  AND  MATERIALS:  We  are  eomparing  image  information  on  DMs  and  SFMs  for 
radiologist’s  interpretation  and  eomputerized  image  analysis.  One  hundred  forty-five  pairs  of  DM 
and  SFM  (76  CC  views  and  69  MLO  views)  were  eolleeted  with  IRB  approval  from  68  patients. 
The  time  interval  between  the  DM  and  SFM  ranged  from  0  to  118  days  (median=21  days).  The 
SFMs  were  aequired  with  GE  DMR  systems  and  the  DMs  were  aequired  with  a  GE  Senographe 
2000D  system.  Both  the  DMs  and  the  SFMs  were  aequired  with  automated  exposure  teehniques 
that  seleeted  the  appropriate  target,  filter,  and  kVp.  The  SFMs  were  digitized  with  a  laser  film 
seanner.  The  breast  boundaries  on  the  DMs  and  SFMs  were  detected  automatically  by  the 
computer.  The  mammograms  were  displayed  on  a  workstation  with  a  graphical  user  interface  that 
allowed  interactive  thresholding  of  the  gray  level  histograms  to  segment  the  dense  region  from  the 
fatty  region.  The  DMs  and  SFMs  were  segmented  independently  in  separate  sessions  so  that  the 
observer  could  not  compare  the  density  of  the  corresponding  DM  and  SFM.  Hard  copies  of  the 
displayed  images  were  available  for  reference  during  segmentation.  The  mammographic  density 
was  estimated  as  the  percent  dense  area  relative  to  the  breast  area,  excluding  the  pectoral  muscle  in 
the  MLO  views. 

RESULTS:  The  correlation  between  the  mammographic  density  on  SFM  and  DM  was  0.94  and 
0.92,  the  root-mean-square  residual  was  4.5%  and  4.6%,  and  the  average  ratio  of  mammographic 
density  estimated  on  SFM  to  that  on  DM  of  the  same  breast  was  1.18  and  1.22,  respectively,  for  CC 
and  MLO  views.  The  differences  in  the  percent  dense  area  between  the  DM  and  SFM  were 
statistically  significant  (paired  t  test:  p<0. 0000001)  for  both  views.  The  DMs  used  harder  beams 
(Mo/Mo  4.5%,  Mo/Rh  22.4%,  Rh/Rh  73.1%)  while  the  SFMs  used  softer  beams  (Mo/Mo  44.2%, 
Mo/Rh  48.1%,  Rh/Rh  7.8%).  The  peak  potential  used  for  DM  was  1  to  5  kVp  higher  than  that  for 
SFM  in  84%  of  cases. 

CONCLUSION:  Breast  density  on  DMs  generally  appears  to  be  lower  than  that  on  SFMs  because 
of  the  harder  beam  quality  used  and  image  processing  applied  to  the  DMs.  The  lower  density  may 
improve  the  mammographic  sensitivity  for  lesion  detection  on  dense  breasts.  However,  for  patients 
with  SFMs  and  DMs  taken  over  time,  comparison  of  serial  mammograms  for  breast  density 
changes  will  be  problematic. 
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network  classification  of  breast  density 
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PURPOSE: 

Our  previous  study  showed  the  feasibility  of  a  rule-based  automatic  breast  density  estimation  method.  However,  the 
rule-based  technique  could  not  classify  the  very  fatty  and  very  dense  breasts  consistently  with  high  accuracy  because 
some  of  these  breasts  have  very  similar  gray  level  histograms.  This  study  develops  a  new  neural  network  classifier  to 
improve  the  performance  of  rule-based  breast  density  estimation. 

MATHOD  &  MATERIALS: 

A  mammogram  is  digitized  and  the  pixel  size  is  reduced  to  0.8  mm.  The  breast  region  is  first  segmented  by  an 
automatic  boundary  tracking  and  a  pectoral  muscle  trimming  algorithm.  An  adaptive  dynamic  range  reduction 
technique  is  used  to  reduce  the  range  of  the  gray  levels  in  the  low  frequency  background  and  to  enhance  the  separation 
of  the  gray  levels  of  the  dense  and  fatty  regions.  The  breast  images  are  first  classified  by  a  rule-based  method  into  a 
class  of  median  dense  and  a  class  of  combined  very  dense/fatty  breasts  based  on  the  characteristics  of  their  gray  level 
histograms.  An  Expectation-Maximization  (EM)  algorithm  is  then  applied  to  the  latter  class  to  extract  the  gray  level 
features.  One  morphological  feature  and  12  EM  extracted  gray  level  features  are  input  to  a  feedforward  neural  network 
to  further  classify  the  mammograms  in  the  combined  class  into  a  class  of  very  dense  breasts  and  a  class  of  very  fatty 
breasts.  For  each  class,  a  gray  level  threshold  is  automatically  estimated  to  segment  the  dense  tissue.  For  comparison, 
an  experienced  radiologist  provided  a  manually  segmented  percent  dense  area  by  interactive  thresholding. 

RESULT: 

In  this  preliminary  study,  498  mammograms  from  141  patients  were  used  and  243  were  classified  into  the  very 
dense/fatty  combined  class  by  the  rule-based  classifier.  With  a  jackknife  method,  this  class  was  randomly  partitioned 
into  four  non-overlapping  groups.  In  each  jackknife  cycle,  three  groups  were  used  for  training  and  one  group  for 
testing.  The  overall  accuracy  for  classification  of  the  four  test  groups  into  very  dense  and  very  fatty  breasts  reached 
99.6%  by  the  neural  network,  and  84.8%  could  be  reached  by  our  previous  rule-based  classifier. 

CONCLUSION 

The  results  demonstrate  the  feasibility  of  training  a  neural  network  classifier  for  the  classification  of  very  dense  and 
fatty  breasts.  The  neural  network  can  be  trained  very  well  using  the  morphological  feature  and  the  features  extracted  by 
the  EM  algorithm.  Combining  the  rule-based  method  with  the  NN  classifier,  the  two-stage  classification  improved  the 
performance  of  our  previous  breast  density  estimation  technique. 
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Studies  have  demonstrated  a  strong  eorrelation  between  mammographic  breast  density 
and  breast  eaneer  risk.  We  have  previously  developed  a  eomputerized  system, 
mammographie  density  estimator  (MDEST),  to  estimate  breast  density  automatieally  on 
digitized  film  mammograms  (DFM).  In  this  study,  we  evaluated  the  performanee  of  the 
MDEST  system  on  full  field  digital  mammograms  (EEDM)  and  DEMs.  The  input  to  the 
system  is  a  preproeessed  dynamie  range  compressed  image,  the  breast  region  is  first 
segmented  by  breast  boundary  detection.  The  pectoral  muscle  is  trimmed  if  it  is  an  MLO 
view.  A  rule-based  classifier  is  then  used  to  classify  the  breast  image  into  one  of  four 
classes  according  to  the  characteristics  of  its  gray  level  histogram.  A  gray  level  threshold 
is  determined  to  segment  the  dense  area  from  the  breast  region.  The  breast  density  is 
estimated  as  the  percentage  of  the  segmented  dense  area  relative  to  the  breast  area.  In  this 
study,  two-view  EEDM  and  the  corresponding  DEM  from  99  patients  with  202  images  in 
each  set  were  used.  The  dense  area  on  each  mammogram  was  segmented  by  4 
radiologists  using  interactive  thresholding  and  their  average  was  used  as  the  “ground 
truth.”  The  MDEST  system  was  directly  applied  to  the  EEDM  and  DEM  data  without 
any  re -training  except  that  the  preprocessing  filter  was  modified  for  EEDMs.  We  found 
that  the  correlation  between  the  estimated  percent  dense  area  and  the  truth  was  0.846 
(0.813,  0.862)  and  0.855  (0.808,  0.861)  on  EEDM,  and  0.880  (0.867,  0.858)  and  0.765 
(0.766,  0.739)  on  DEM,  for  CC  and  MLO  views,  respectively.  The  results  demonstrated 
the  feasibility  of  estimating  breast  density  automatically  on  EEDM  and  DFM  using  the 
same  MDEST  system. 
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SUMMARY:  Automatic  identification  of  the  pectoral  muscle  on  MLO  view  is  an  essential  step  for 
computerized  analysis  of  mammograms.  It  can  reduce  the  bias  of  mammographic  density  estimation, 
will  enable  region-specific  processing  in  lesion  detection  programs,  and  also  may  be  used  as  a 
reference  in  image  registration  algorithms.  We  are  developing  a  computerized  method  for  the 
identification  of  pectoral  muscle  on  mammograms. 

The  upper  portion  of  the  pectoral  edges  was  first  detected  to  estimate  the  direction  of  the  pectoral 
muscle  boundary.  A  gradient-based  directional  bandpass  (GDB)  filter  was  used  to  enhance  the  linear 
texture  structures,  and  then  a  gradient-based  texture  analysis  is  designed  to  extract  a  texture  orientation 
image  that  represented  the  dominant  texture  orientation  at  each  pixel.  The  texture  orientation  image 
was  enhanced  by  a  second  GDB  filter.  An  edge  flow  propagation  method  was  developed  to  extract 
edges  around  the  pectoral  boundary  using  geometric  features  and  anatomic  constraints.  The  pectoral 
boundary  was  finally  generated  by  a  second-order  curve  fitting.  118  MLO  view  mammograms  were 
tested  in  this  study.  The  pectoral  muscle  boundary  identified  on  each  image  by  an  experienced 
radiologist  was  used  as  the  gold  standard. 

The  accuracy  of  pectoral  boundary  detection  was  evaluated  by  two  performance  metrics.  One  is  the 
overlap  percentage  between  the  computer-identified  area  and  the  gold  standard,  and  the  other  is  the 
root-mean-square  (RMS)  distance  between  the  computer  and  manually  identified  pectoral  boundary. 
For  118  MLO  view  mammograms,  99.15%  (117/118)  of  the  pectoral  muscles  could  be  identified.  The 
average  of  the  overlap  percentage  is  85.9%  with  a  standard  deviation  of  14.0%,  and  the  average  of  the 
RMS  distance  is  4.31  mm  with  a  standard  deviation  of  5.94  mm.  These  results  indicate  that  the 
pectoral  muscle  on  mammograms  can  be  detected  accurately  by  our  automated  method. 
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PURPOSE:  The  peetoral  muscle  is  imaged  in  most  of  the  medial-lateral  oblique  (MLO)  view 
mammograms.  Automatic  identification  of  the  pectoral  muscle  is  an  essential  step  for  computerized 
analysis  of  mammograms.  It  can  reduce  the  bias  of  mammographic  density  estimation,  will  enable 
region-specific  processing  in  lesion  detection  programs,  and  also  may  be  used  as  a  reference  in  image 
registration  algorithms.  The  goal  of  this  study  is  to  develop  an  automated  method  to  identify  the 
pectoral  muscle  on  mammograms. 

METHOD  &  MATERIALS:  Because  of  noise  and  the  overlapping  structures,  the  pectoral  boundary 
is  usually  detected  as  pieces  of  edges  mixed  with  a  large  number  of  false  edges.  The  pieces  of  edges 
have  to  be  pruned  and  linked  to  obtain  a  continuous  boundary.  We  are  developing  a  pectoral  boundary 
detection  algorithm  based  on  gradient-based  texture  orientation  analysis.  The  interference  due  to 
overlapping  of  the  glandular  tissue  on  the  pectoral  muscle  region  is  first  reduced  by  blurring  the 
mammogram  using  an  edge  preserving  anisotropic  diffusion  filter.  Because  less  glandular  tissue 
appears  at  the  upper  region  of  the  pectoral  muscle,  the  upper  portion  of  the  pectoral  boundary  usually 
remains  sharp  after  blurring  and  can  be  detected  robustly  by  searching  the  maximum  horizontal 
gradients  on  the  diffused  image.  The  extrapolation  of  the  detected  upper  pectoral  boundary  provides  a 
coarse  global  direction  of  the  pectoral  boundary.  To  explore  and  refine  the  entire  pectoral  boundary,  a 
gradient-based  directional  bandpass  (GDB)  filter  is  first  employed  to  enhance  the  linear  texture 
structures  on  the  mammogram.  A  gradient-based  texture  analysis  is  then  used  to  compute  an 
orientation  image  which  represents  the  dominant  texture  orientation  at  each  pixel.  The  orientation 
image  is  smoothed  using  an  edge  preserving  mean  shift  algorithm  that  iteratively  shifts  each  pixel  to 
the  average  of  the  pixels  in  its  neighborhood.  A  second  GDB  filter  is  applied  to  the  smoothed 
orientation  image  to  enhance  the  texture  patterns  that  have  high  similarity  in  the  dominant  texture 
orientations.  Candidate  edges  of  the  pectoral  muscle  are  detected  on  the  enhanced  orientation  image 
using  a  ridge-tracking  algorithm.  With  the  guidance  of  the  estimated  global  direction  of  the  pectoral 
boundary  and  the  anatomical  constraints,  an  edge  flow  propagation  algorithm  is  used  to  extract  the 
boundary  points  of  the  pectoral  muscle  by  pruning  the  edges  that  are  less  likely  to  lie  on  the  pectoral 
boundary.  A  second  order  curve  fitting  is  used  to  generate  the  final  pectoral  muscle  boundary.  118 
MLO  view  mammograms  from  103  patients  were  randomly  selected  for  the  evaluation  of  the 
algorithm  in  this  study.  The  mammograms  were  digitized  with  0.05  mm/pixel  resolution  and  reduced 
to  a  resolution  of  0.8  mm/pixel  for  pectoral  muscle  identification.  An  experienced  radiologist  used  a 
graphical  user  interface  to  manually  draw  the  pectoral  muscle  boundary  on  each  mammogram,  which 
was  then  used  to  define  the  pectoral  muscle  region  and  used  as  the  gold  standard  for  the  evaluation  of 
computer  performance. 

RESULTS:  For  each  MLO  view  mammogram,  the  accuracy  of  pectoral  boundary  detection  was 
evaluated  by  two  performance  metrics:  the  percentage  of  computer  detected  pectoral  muscle  area 
overlapped  with  the  gold  standard,  and  the  root-mean-square  (RMS)  distance  between  the  computer- 
identified  pectoral  boundary  and  the  manually  marked  pectoral  boundary.  For  118  MLO  view 
mammograms,  99.15%  (117/118)  of  the  pectoral  muscles  could  be  identified,  the  average  of  the 
percent  overlap  area  is  85.9%  with  a  standard  deviation  of  14.0%,  the  average  of  the  RMS  distance  is 
4.31  mm  with  a  standard  deviation  of  5.94  mm. 


NEW  WORK  TO  BE  PRESENTED:  Accurate  identification  of  the  pectoral  muscle  on 
mammograms  is  challenging,  espeeially  for  the  improperly  positioned  MLO-view  images  and  the 
images  containing  dense  glandular  tissues  overlapping  with  the  pectoral  muscle  region.  In  this  work, 
we  developed  a  two-stage  gradient-based  texture  analysis  method  to  detect  the  pectoral  boundary.  In 
the  first  stage,  linear  texture  struetures  were  enhaneed  and  the  directional  gradients  were  eomputed 
using  a  directional  bandpass  filter.  In  the  seeond  stage,  a  texture  orientation  image  was  derived  as  the 
dominant  texture  orientation  at  each  pixel.  A  diffusion  filter  was  used  to  estimate  the  global  direction 
of  the  peetoral  boundary.  An  edge  flow  propagation  method  was  developed  to  extract  the  pectoral 
edges  with  the  guidance  of  the  estimated  global  direetion. 

CONCLUSION:  The  newly  developed  gradient-based  directional  filter  and  the  dominant  texture 
orientation  image  estimation  method  can  enhance  the  pectoral  boundary  regions.  The  edge  flow 
propagation  method  can  accurately  extract  pectoral  edges  to  generate  the  peetoral  boundary.  Automatie 
pectoral  muscle  identification  will  provide  the  foundation  for  many  image  analysis  tasks  in  CAD 
applications. 
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PURPOSE: 

We  have  previously  developed  an  automatic  mammographic  density  estimator  (MDEST)  on  digitized  film 
mammograms  (DEM).  In  this  study,  we  modified  MDEST  to  estimate  breast  density  on  full  field  digital  mammograms 
(FFDM)  and  further  improved  the  performance  of  the  MDEST  on  DFM. 

METHOD  &  MATERIALS: 

The  breast  region  is  first  extracted  by  breast  boundary  detection.  The  pectoral  muscle  is  trimmed  if  it  is  an  MLO  view. 
An  adaptive  dynamic  range  reduction  technique  is  used  to  reduce  the  gray  level  range  in  the  low  frequency 
background.  The  breast  image  is  classified  into  one  of  four  classes  ranging  from  fatty  to  very  dense  based  on  the 
characteristics  of  their  gray  level  histograms.  For  each  class,  an  Expectation-Maximization  (EM)  algorithm  is 
developed  to  extract  gray  level  features  and  a  rule-based  classifier  is  trained  to  segment  the  dense  regions  from  the  fatty 
background.  The  parameters  of  the  new  rule-based  method  are  trained  separately  for  FFDMs  and  DFMs.  The  breast 
density  is  estimated  as  the  percentage  of  the  segmented  dense  area  relative  to  the  breast  area.  Two- view  FFDMs  and  the 
corresponding  DFMs  from  99  patients  with  202  images  in  each  set  were  used  as  the  test  set.  The  computerized 
segmentation  on  the  two  sets  of  mammograms  is  compared  to  the  “gold  standard”,  which  is  obtained  from  interactive 
thresholding  segmentation  averaged  over  4  MQSA  radiologists  for  each  mammogram. 

RESULT: 

For  FFDM,  the  correlation  between  the  computer  estimated  percent  dense  area  and  the  gold  standard  was  0.94  for  CC 
view,  0.92  for  MLO  view,  and  0.96  for  each  breast  with  the  percent  dense  area  estimated  as  the  average  of  two  views. 
The  corresponding  root-mean- square  (RMS)  error  was  4.2%,  4.4%,  and  3.5%,  respectively.  For  DFM,  the 
corresponding  correlations  were  0.88,  0.86  and  0.92  with  RMS  error  of  7.0%,  7.1%  and  5.7%,  respectively. 

CONCLUSION 

The  results  demonstrate  the  feasibility  of  estimating  breast  density  automatically  on  FFDMs  and  DFMs  using  the  same 
MDEST  system  by  only  incorporating  a  new  EM  estimation  step.  The  adaptability  of  the  new  EM  method  improved 
the  robustness  of  our  breast  density  estimation  technique  for  mammograms  acquired  with  different  imaging  systems. 


